Advanced Methods of 
Pharmacokinetic and 
Pharmacodynamic 
Systems Analysis 




Edited by 

David Z. D’Argenio 




Advanced Methods of 
Pharmacokinetic and 
Pharmacodynamic 
Systems Analysis 
Volume 3 




THE KLUWER INTERNATIONAL SERIES 
IN ENGINEERING AND COMPUTER SCIENCE 




Advanced Methods of 
Pharmacokinetic and 
Pharmacodynamic 

Systems Analysis 
Volume 3 



Edited by 

David Z. D’Argenio 

University of Southern California 
Los Angeles, California 



KLUWER ACADEMIC PUBLISHERS 

NEW YORK, BOSTON, DORDRECHT, LONDON, MOSCOW 



eBook ISBN: 0-306-48523-0 

Print ISBN: 1-4020-7804-8 



©2004 Kluwer Academic Publishers 

New York, Boston, Dordrecht, London, Moscow 

Print ©2004 Kluwer Academic Publishers 
Boston 

All rights reserved 



No part of this eBook may be reproduced or transmitted in any form or by any means, electronic, 
mechanical, recording, or otherwise, without written consent from the Publisher 



Created in the United States of America 



Visit Kluwer Online at: 
and Kluwer's eBookstore at: 



http://kluweronline.com 

http://ebooks.kluweronline.com 



CONTENTS 



Preface 



vii 



MECHANISM-BASED PK/PD 

Physiological Modeling of the Small Intestine in Drug Absorption ... 3 



K. Sandy Pang 

Intracellular Metabolism of Lamivudine 33 

Zexun Zhou, David Z. D'Argenio, Brian L. Robbins and John H. Rodman 

Mechanism-Based Pharmacodynamic Modeling for Prediction 

of Exposure Response 49 

Meindert Danhof, Piet H. van der Graaf, Sandra A.G. Visser and 
Klaas P. Zuideveld 



Mechanism-Based PK/PD Modeling of Digoxin: Role of Receptor 



Heterogeneity and Postrecepotor Events 67 

Michael Weiss and Wonku Kang 

Pharmacodynamics and Pharmacogenomics of Coiticosteroids: 

Microarray Studies 85 

William J. Jusko, Jin Y. Jin, Debra C. DuBois and Richard R. Almon 

Bayesian Inference in Physiologically-Based Pharmaco ki netic 



Modeling: Application to Anticancer Drug Development .... 105 
Lu Xu, David Z. D'Argenio, Julie L. Eiseman and Merrill J.Egorin 



PHARMACOMETRICS 

Monte Carlo Parametric Expectation Maximization (MC-PEM) 
Method for Analyzing Population Pharmacokinetic/ 



Pharmacodynamic Data 135 

Robert J. Bauer and Serge Guzy 

Application of Markov Chain Monte Carlo Methods to 

Inverse Problems in PK/PD 165 

Dongwoo Kang and Davide Verotta 

Regenerative Markov Chain Monte Carlo Methods for 

Pharmacokinetic/Pharmacodynamic Estimation 179 

Dongwoo Kang. Alan Schumitzky and David Z. D’Argenio 




Vizstruct: A Multidimensional Visualization Algorithm for Gene 

Expression 197 

Murali Ramanathan, Li Zhang and Aidong Zhang 



PHARMACOTHERAPY 

Modeling and Simulation of Effects of Adherence, Pharmacokinetic, 
and Pharmacodynamic Characteristics in Development of 
Resistance to HIV Regimens in Treatment-Naive Patients ... 211 
Ann Hsu, Russ Wada, Bill Poland, Eugene Sun and Rick G. Granneman 

Mechanistic PK/PD Modeling of Antiretroviral Therapies in AIDS 



Clinical Trials 221 

Yangxin Huang and Hulin Wu 

Dose Individualization for High-Dose Anti-Cancer 

Chemotherapy 239 

Gary L. Rosner, Peter Muller, Feng Tang, Timothy Madden and 
Borje S. Andersson 

Toward Model-Based Chemotherapy Treatment Design 255 

Robert S. Parker, John M. Harrold, Julie L. Eiseman, 

William C. Zamboni, Erin Joseph, Sandra Strychor and Merrill J. Egorin 

Control of Uncertain Pharmacodynamic Systems 275 

Kyungsoo Park and David Z. D'Argenio 



Contributors 295 

Participants 305 

Index 317 



vi 




PREFACE 



This volume is the third in the series “Advanced Methods of Pharmacokinetic 
and Pharmacodynamic Systems Analysis” sponsored by the Biomedical Simulations 
Resource (BMSR) at the University of Southern California as part of its dissemination 
activities. Through its support from the Division of Bioengineering at the National 
Institute of Biomedical Imaging and Bioengineering at the N1H, the BMSR develops 
novel methods for biomedical modeling and simulation and distributes its knowledge 
and methodologies through its collaborative research projects, as well as via service, 
training and dissemination activities. 

The contributions to this volume are derived from the presentations made at the 
BMSR sponsored 10th Workshop on Advanced Methods of PK/PD Systems Anal- 
ysis held in Marina del Rey, California on June 16 and 17, 2003. The chapters 
are arranged in three sections under the general headings: mechanism-based PK/PD; 
pharmacometrics; and pharmacotherapeutics. The first section includes contributions 
on drug absorption in the small intestine and on cellular metabolism of nucleoside 
analogues, followed by four chapters focusing on in vivo exposure -response modeling 
that include fundamental aspects of receptor theory and cellular signaling pathways 
in PK/PD model development. In the second section on pharmacometrics, new ap- 
proaches are presented for population modeling, deconvolution, Bayesian computa- 
tion, and visualization of gene expression data. The section on pharmacotherapeutics 
includes chapters on modeling resistance in antiretroviral therapy and on modeling 
HIV dynamics during long-term treatment, as well as chapters on deterministic and 
stochastic approaches to dose regimen design in chemotherapy, and on control of 
pharmacodynamic processes. 

I wish to express my gratitude to all the authors for their excellent contributions to 
this volume. These researchers, as well as others who have contributed to our previous 
meetings, are at the forefront of innovation in pharmacokinetics/pharmacodynamics 
and its application to drug development. In addition 1 would like to recognize Marcos 
Briano of the BMSR for his considerable efforts in coordinating the preparation of 
this volume, as well as his skillful administration of all of the activities of the BMSR. 
It is with pleasure that I acknowledge Dr. Grace C.Y. Peng, Program Director of the 
Division of Bioengineering at the NIBIB/NIH, for the guidance and encouragement 
she has provided to the BMSR. 



David Z. D’Argenio 
Los Angeles, California 




MECHANISM-BASED 

PK/PD 




PHYSIOLOGICAL MODELING OF THE 
SMALL INTESTINE IN 
DRUG ABSORPTION 



K. Sandy Pang 

Department of Pharmaceutical Sciences 
University of Toronto 



ORAL DRUG ABSORPTION 

The extent and rate of drug absorption are strongly modulated by the 
physicochemical properties of drug and physiology of the gastrointestinal 
tract (GIT). Additionally, drug dissolution from its dosage form could 
constitute the slowest or rate-determining step in drug absorption [1]. 
The drug needs to leave the aqueous environment to interact with the 
membrane for permeation. Then the drug needs to survive metabolism 
and efflux by the intestine, liver and lung, the first-pass organs, before 
the drug reaches the systemic circulation [2]. These various events may 
lead to significant reduction of the orally administered dose. 

The intestine is an important tissue that regulates the extent of ab- 
sorption of orally administered drugs, and the intestine, liver and lung 
are involved in first-pass removal [3,4]. The intestine is unique in that 
it is the anterior, portal tissue that regulates the flow of substrates to 
the liver, then the lung. The venous drainage of the intestine consti- 
tutes the majority of the blood supply to the liver, accounting for 75% 
of total liver blood flow. The drug concentration [5,6] and the intestinal 
flow rate [7], factors that alter the rate of drug delivery, affect the de- 
gree of saturability of the intestine as well as the liver. For drugs that 
are highly cleared by the intestine, the contribution of the liver or lung 
to drug metabolism will become reduced. By contrast, drugs that are 
poorly extracted by the intestine are able to reach the next first-pass 
organs, the liver and the lung, for removal [8,5,9]. 

This chapter describes the roles of intestinal transporters and metabo- 
lism in the determination of oral bioavailability and first-pass removal. 
The majority of drug absorption occurs at the small intestine instead of 
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the stomach or the large intestine because of the large surface area due to 
the presence of villi and microvilli that increase the surface area many- 
fold. Both the duodenum and jejunum possess higher surface areas than 
the ileum [10]. 



PHYSICOCHEMICAL PROPERTIES OF DRUGS 

There has been much effort devoted to relate the physicochemical 
properties of drug with oral drug absorption . The drug, whether a 
weak acid or weak base, the pKa [11-14], the microclimate pH [15], 
the unstirred water layer, USWL [16,17], solubility, dose, and fraction 
unionized are factors that impact on the fraction of dose absorbed [18— 
20]. The concept of an absorption potential has been utilized to describe 
drug absorbability based on the partition coefficient. It is generally ac- 
cepted that drugs that are unionized or that undergo hydrogen bonding 
exhibit a much greater lipophilicity towards membrane pemieation than 
their ionic counterparts, whereas too many hydrogen donor or acceptor 
groups, however, is not good [21]. Lipophilicity is a major determi- 
nant for membrane permeation, and the extent of absorption is often 
correlated to the partition coefficient, for drugs that enjoy good aque- 
ous solubility when the unstirred water layer is not an imposing barrier 
[22]. Predictive models based on Lipinski’s Rule of Five [21], the Quan- 
titative Structure Bioavailability Relationship (QSBR) [23], or others 
QSAR models [24-27] have been developed to predict drug permeation 
potentials. 



THE INTESTINE 

In addition to mediating passive absorption by the villi and microvilli, 
the small intestine is endowed with anionic and cationic transporters 
[28-35]. These are found within the enterocytes that are located abun- 
dantly at the villous tip (Fig. 1). The various apical transporters for the 
absorption of organic anions and cations have been reviewed [29,33,32] 
(Table 1). The efflux transporters, known collectively as the ABC (ATP 
binding cassette proteins), mediate exsorption at both the apical and 
basolateral membranes [36-38]. Important examples are the 170 kDa 
P-glycoprotein (Pgp, or the multidrug resistance gene product, MDR1) 
[39-42] and the multidrug resistance-associated protein 2 (MRP2) [43- 
45]. Moreover, the newly characterized breast cancer resistance pro- 
tein (BCRP) multidrug transporter confers resistance to mitoxantrone, 
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topotecan, the anthracyclines, and related drugs in cell lines [46-51]. 
Drug efflux back to the intestinal lumen effectively reduces drug absorp- 
tion as well as the sojourn of drug within the enterocyte [52-54]. Drugs 
may also be effluxed into the circulation by MRP3 [55] or the monocar- 
boxylic acid transporter 1, Mctl, [56,57] at the basolateral membrane. 




Fig. 1 . Schematic diagram of the enterocyte of the small intestine showing absorp- 
tive and efflux transporters at the apical and basolateral membranes, and enzymes 
for intracellular metabolism. 



Drug metabolizing enzymes such as the cytochromes [58-64] and Phase 
II enzymes [65,66] are present within the enterocytes. Needless to say, 
the presence of intestinal enzymes reduces drug bioavailability. Many 
drugs are substrates of both Pgp and cytochrome P450 3A [67-69]. Ex- 
amples of these are verapamil [70,71], vincristine, etoposide, daunoru- 
bicin, paclitaxel [72-75,38,76], digoxin [77,78], the HIV protease in- 
hibitor indinavir [79-81], cyclosporin [82,53], tacrolimus [83-85], and 
sirolimus [86,87]. Both Pgp and CYP3A are under regulation of the 
pregnane X-receptor [88], Although the impact of PXR on liver is well 
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Table 1 . Involvement of intestinal transporters for drug absorption. 



Transporter 


Substrate 


| Ref. 


Asbt or ibat 

Ileal sodium-dependent bile acid 
transporter 


Taurocholate 


g 


Amino acid carrier 


D- and L-methionine, 1-methyldopa, 
bacolfen, taurine, alanine 




BCRP 

Breast cancer resistance protein 


topotecan, mitoxanthrone, 
anthracyclines 


HI 


CNT 

Nucleoside transporters N1 (purine) 
N2 (pyrimidine) 


zidovudine, stavudine, thymidine, 
ribavirin, allopurinol, 5-fluorouracil 




Folic acid transporter 


methotrexate 




Peptl 

Oligopeptide transporter 1 


aminocephalosporins, /3-amino 
lactam antibiotics (cyclacillin, 
cephalexin, cephradine) ACE 
inhibitors (benazepril, quinapril), 
bestatin, renin inhibitors 


[28] 


MDR1 or Pgp 

Multidrug resistance protein 1 
or P-glycoprotein 


daunomycin, Rhodaminel23, 
verapamil, diltiazepm, colchicines, 
digoxin, quinidine vincristine, 
etoposide, /3-blockers - sotalol, 
pafenolol, celiprolol, tanilol 




Mctl 

Monocarboxylic acid transporter 1 


lactate and pyruvate transport 
(simple aryl and alkyl acids) 




Mrp2 

Multidrug resistance-associated 
protein 2 


glutathione conjugates, glucuronide 
conjugates, leukotrience C4, 
vincristine 


[44] 

[45] 


Mrp3 




[108] 


NPTl 

Sodium-dependent phosphate 
tranporter 1 


fosfomycin, foscarnet 


[149] 


Oatp3 

Intestinal organic anion 
transporting polypeptide 3 


Taurocholate, estrone sulfate 


[55] 


OCT1 

Organic cation transporter 1 


spermine, spermidine 




Peptl 

Oligopeptide transporter 1 


aminocephalosporins, /3-amino 
lactam antibiotics (cyclacillin, 
cephalexin, cephradine) ACE 
inhibitors (benazepril, quinapril) 
bestatin, renin inhibitors 


[28] 
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recognized [89], less is known regarding the effect of PXR on drug trans- 
port and metabolism within the intestine. 



HETEROGENEITY OF INTESTINAL 
TRANSPORTERS 

Analogous to that found for the liver [5,90,91], there exists an increas- 
ing body of literature on the heterogeneity of intestinal transporters (Ta- 
ble 2). Although the absorption of some drugs - salicylate [22], antipyrine 
[92], acetaminophen [93], griseofulvin [94], (-)-carbovir [154], and meto- 
prolol [155] - was similar among all the intestinal segments, variations 
in absorption were found for other drugs among the segmental regions. 
The extents of absoiption of ranitidine [95] and talinolol [96] were higher 
in the proximal intestine. Preferential absorption in segmental regions 
was noted for the intestinal absoiption of ranitidine [156] and diltiazem 
[97]. For hydrochlorothiazide, atenolol, furosemide and cimetidine [98], 
the net mucosal to serosal absorption was greater for the jejunum than 
for the ileum. For verapamil [99], phenytoin, almokalant, gemifibrozil, 
metoprolol, omeprazol, propranolol, foscamet, erythritol, dDVAP [22], 
and etoposide [100], net mucosal to serosal absorption was greater for 
the ileum over the jejunum. 

Recent advances on the detection of immunoreactive proteins have 
provided further definitive proof on variations in segmental distributions 
of the intestinal transporters [28,101,45,55,102]. The studies revealed 
heterogeneity of apical absorptive and secretory transporters among seg- 
ments. The faster absorption rate constant of benzoic acid for jejunal 
administration in the perfused rat small intestine preparation correlated 
well with the higher distribution of the Mctl (the monocarboxylic acid 
transporter 1) among jejunal enterocytes in comparison to those in duo- 
denal or ileal segment [103] (Fig. 2). The proton-coupled oligopeptide 
transporter 1, Peptl, of the rabbit was more abundant in the proximal 
intestine (duodenum andjejunum) [28]; the rat organic anion transporter 
3, Oatp3, is higher in the jejunum [55]; the apical bile salt transporter 
(Abst) predominates in the distal ileum of the hamster and rat [104,101]. 
Gotoh et al. [44] demonstrated the greater mRNA expression of Mrp2 in 
the rat jejunum, followed by the duodenum and ileum, with very little in 
the colon. The same was found by Mottino et al. [45]. The excretion of 
the glutathione conjugate 2,4-dinitrophenyl-S-glutathione by Mrp2 was 
greatest in the jejunum [44]. By contrast, the Pgp efflux pump is higher 
distally at the jejunum and ileum [39,99,105,53,106,74,107,81] (Fig. 2). 
The rat basolateral Mrp3 that transports drug out of the cell is highest 
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Table 2. Segmental distribution of transporters and enzymes among the intestinal 
segments. 



Transporter/Enzymes | Segmental Distribution | References 

Transporters 



Asbt 


distal intestine > proximal intestine 




101] [150] 


CNT 


highest in jejunum 




102] 


Mctl 


duodenum < jejunum > ileum 




57][103] 


Mdrl 


jejunum ~ ileum > duodenum 




70] [106] [81] 


Mrp2 


duodenum ~ jejunum > ileum 




44] [45] 


Mrp3 (basolateral) 


ileum > jejunum > duodenum 




108] 


Oatp3 


highest in jejunum 




55] 


Peptl 


proximal intestine > distal ileum 
function: (jejunum > ileum > duo- 
denum) 




28] 



Enzymes 



CYP3A 


duodenum ~ jejunum > ileum 




62] [81] 


Estrone sulfatase 


proximal > distal 




l5lf 


GST 


duodenum ~ jejunum > ileum 




111] 


PST 


duodenum ~ jejunum > ileum 




109] 


UGT 


duodenum ~ jejunum > ileum 




152] 



towards the distal ileum and colon [108]. The heterogeneity of trans- 
porters among the intestinal segments (Table 2) is expected to affect 
drug absorption and bioavailability. 



HETEROGENEITY OF INTESTINAL ENZYMES 
FOR METABOLISM 

The intestinal tissue is endowed with phase I and II enzymes, usually 
at lower levels compared to those for the liver. Drug metabolizing en- 
zymes: UDP-glucuronosyltransferases (UGT), sulfotransferases (PST), 
and glutathione S-transferases (GST) exhibit a decreasing gradient along 
the intestinal wall, from duodenum to ileum [109-111,59] (Table 2). The 
distribution of the human intestinal CYP3A4 paralleled that of the rat 
for the small intestine, and showed a slightly lower level at the duodenum 
before levels rise again at the jejunum, then finally decreasing towards 
the ileum [62,64,58,81]. 
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Mctl 

Mdrl /Pgp 




SI 12 S3 S4 SS S6 S7 S8 




Fig. 2. Heterogeneous distributions of the immunoreative proteins, Mctl and Pgp, 
among enterocytes harvested according to [147], from segments 1 to 8 (SI to S8) of 
the rat, small intestine. Segment 1 is the duodenum; segment 2 is the jejunum of 
equal length to the duodenum that immediately follows; segments 3 to 8 are of equal 
length from the remaining intestine. 



ROUTE-DEPENDENT INTESTINAL 
METABOLISM/EXCRETION 

The phenomenon of route-dependent intestinal metabolism or excre- 
tion has been observed (Table 3). Route -dependent intestinal removal 
describes a greater intestinal metabolism/excretion of drug upon oral or 
luminal dosing vs. systemic dosing. In studies pertaining to the per- 
fused, rat small intestine preparations, greater extents of metabolism 
were noted for acetaminophen [157], enalapril [158] morphine [112] and 
(-)aminocarbovir, the prodrug that was converted to (-)carbovir [113], 
when these were given luminally, whereas metabolism was either absent 
or negligible when the drug dose was given into the reservoir for systemic 
delivery. The results from the perfused vascular intestinal preparations 
mirrored the observations on midazolam hydroxylation in humans. The 
drug exhibited a low intestinal extraction ratio (0.09) in anhepatic pa- 
tients undergoing liver transplantation with systemic administration, 
but extensive first-pass metabolism was noted orally (extraction ratio 
of 0.43), with much of the intestinally formed primary metabolite, 1’- 
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hydroxymidazolam, being detected in the hepatic portal blood [62-64]. 
Also, the erythromycin breath test given intravenously only correlated 
to liver but not intestinal CYP3A4 levels [114,115], and the observation 
translates to inaccessibility of systemically administered erythromycin to 
the intestinal mucosal (CYP3A4) enzymes. Analogously, pre-treatment 
of humans with rifampin, a PXR ligand, on Pgp secretion exerted an ef- 
fect only on the oral but not intravenous kinetics of digoxin [78], suggest- 
ing that the intestinal Pgp is accessible for digoxin administered into the 
lumen. The above findings infer the enzymes for preabsorptive intesti- 
nal metabolism/efflux are always present in enterocytes facing the lumen 
and are unavailable to drugs in the circulation. These observations are 
consequences of route-dependent intestinal metabolism/excretion [116]. 



Table 3. Route-dependent intestinal metabolism. 



Drugs 


Route of Administration 


References 


Intraduodenal 


Systemic 1 


acetaminophen 


glucuronide/Sulfate 


ND 


[157] 


(-)aminocarbovir 


(-)carbovir 


less 


[113] 


cyclosporin 


+ 


ND 


[153] 


enalapril 


enalaprilat 


ND 


[158] 


midazolam 


1-hydroxymidazolam 


less 


[62] 

[64] 


morphine 


morphine 3/J-D-glucuronide 


ND 


[112] 


tacrolimus 


+ 


ND 


[83] 



GASTROINTESTINAL MOTILITY 

The presence of interacting drugs and food affects the transit times 
within the gastrointestinal tract [117,118]. Variations in gastric emp- 
tying rate bring about modulations of intestinal drug absorption. In- 
evitably, a delay in stomach emptying reduces the rate of drug absorp- 
tion since the delivery rate to the small intestine is prolonged. Acid labile 
drugs will endure a greater degradation upon prolongation of stay in the 
stomach, thereby diminishing the extent of absorption [119,120]. How- 
ever, increasing the sojourn of compounds in the stomach will promote 
dissolution and improve the extent of absorption of relatively insoluble 
drugs such as griseofulvin and phenytoin [121,122]. But no change is 
anticipated for drugs of good water and lipid solubility [119,123]. For 
drugs whose intestinal transport is dependent on apical transporters, the 
reduced and intermittent release of drug from the stomach to the small 
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intestine is expected to bring about a desaturation of the transporter 
system, rendering increased extents of absorption [124]. 



INTESTINAL MODELING IN VITRO AND IN 
SMALL INTESTINE 

Intestinal secretion is often studied in the Caco-2 cell culture system, 
where drug may be applied to the apical side (ap) or basolateral (baso) 
side (Fig. 3). A greater apical to basolateral flux (A to B) over that 
of B to A suggests involvement of Pgp [70]. Upon addition of CYP3A4 
to the system, the dynamic interactions of metabolism and secretion 
may be estimated. Recent in vitro studies have led to the conclusion 
that intestinal metabolism could be enhanced when the substrate is se- 
creted by P-glycoprotein due to an increase in the mean residence time 
(MRT) of drug in the intestine [125,126]. Theoretical examinations of 
the Caco2 cell culture system did support the notion that the MRT 
was increased with secretion, but drug metabolism was in fact decreased 
with secretion under linear conditions (Fig. 4) [127]. Under nonlinear 
metabolism, however, instances existed whereby the metabolite forma- 
tion rate may increase even though the ultimate amount of metabolite 
formed remained equal to the dose [127]. Increased secretion and rapid 
re-absorption of drug from the apical compartment to the cell evoked 
desaturation of intestinal enzymes and promoted greater rates of drug 
metabolism. 

The modeling of drug absorption data in the intact small intestine is 
complex, but is often handled simplistically with compartment analy- 
ses for first order or zero order absorption [128,129]. Modeling efforts of 
data in vivo included the gastrointestinal absorption (GITA) and kinetic 
(GITK) models that examine absorption of drug from the stomach, duo- 
denum, upper and lower jejunum and ileum, cecum and large intestine 
with varying transit times and absorption [118]. The important task re- 
mains to provide mechanistically-based modeling so as to allow insight 
into intestinal metabolism, efflux, absorption, and intestinal transit, in- 
cluding route-dependent intestinal metabolism. There had been the- 
oretical investigations on the Compartmental Absorption and Transit 
model (CAT) that considered the transit and absorption of drug in the 
small intestine (7 compartments) and there was no absorption within the 
stomach and colon compartments. Absorption within the seven mixing- 
tanks in series compartments for the small intestine was governed by the 
gastric emptying rate and different intestinal transit times [130,131,20]. 
Since the CAT model did not include the physical modeling of drug 
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Fig. 3. Schematics of the Caco-2 cell culture system. The system consists of a 
donor, apical compartment, a cell monolayer compartment, and a receiving compart- 
ment, the basolateral compartment. Metabolism occurs from the cellular compart- 
ment (with CLi n t,m<,t as the metabolic intrinsic clearance); secretion occurs at the 
apical membrane with the secretory intrinsic clearance denoted as CLint.se o Only 
the unbound drug traverses across the compartments (denoted by f, with subscripts 
for the apical (ap), basolateral (baso) and cellular (cell) compartments). Drug per- 
meation clearances at the basolateral membrane are denoted as CLdi for influx and 
CLd 2 for efflux; k a is the absorption rate constant (passive and transporter-mediated) 
at the apical membrane. Taken from [148), with permission. 



CO 




O 

< 

J 




Fig. 4. Simulations, based on the Caco-2 cell culture system and linear transport 
and metabolism. Increasing the secretory intrinsic clearance {C Lint, sec) decreased the 
rate of metabolite accrual in the cell (Met ce ti) after administration of the dose into 
apical (A) or basolateral (B) compartment. Note that increasing CLint.sec decreased 
metabolite accrual. Adopted from [148], with permission. 
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dissolution from dosage solid forms or controlled release formulations, 
degradation in lumen, changes in absorptive surface area, absorption in 
the stomach and colon, metabolism in liver, and transporter densities 
for absorption and efflux, a refined and expanded model, known as the 
Advanced CAT or ACAT model was developed, with implementation 
in software, for the prediction of drug absorption [132]. Details such as 
particle size, pH, particle density, and diffusion coefficient were included 
for consideration of drug dissolution and absorption. In another model, 
Ito et al. [133] described intestinal metabolism and secretion, intracel- 
lular drug diffusion and permeation through the basolateral membrane; 
however, the model seemed to lack description of intestinal flow. More- 
over, these models do not relate to drug partitioning, transporters, and 
the phenomenon of route-dependent intestinal metabolism. A greater 
understanding of the interplay will allow sound interpretations on in- 
testinal drug metabolism and transport with respect to other drugs or 
the intake of fruit juices [134-136]. 

Physiological models: Traditional model (TM) and 
Segregated Flow Model (SFM) 

Experiments based on the perfused rat small intestine preparation had 
provided data that led to the understanding of intestinal processes in ab- 
sence of the contribution from other eliminating organs. From modeling 
of these data with physiologically-based models, the developed models 
may be refined and extended to encompass important variables such 
as gastrointestinal transit, metabolism, transport, and efflux [112,137] 
(Fig. 5). The Traditional physiological Model (TM) (Fig. 5A) was de- 
veloped to describe data arising from recirculation of tracer morphine in 
the perfused rat small intestine preparation [112]. In this preparation, 
morphine glucuronidation was observed with dosing of morphine into 
the duodenal lumen and not with administration into the reservoir that 
mimicked intravenous administration (Figs. 6 and 7). 

To addresses the apparent inaccessibility of intestinal enzymes to the 
drug in circulation, the Segregated Flow Model (SFM) was developed 
within our laboratory (Fig. 5B). This model was based on modifications 
of the model of Klippert and Noordhoek [138]. The SFM described a 
small and low portion of the intestinal blood flow to the active enterocyte 
region where the absorptive and exsorptive carriers and metabolic en- 
zymes reside. The remaining, bulk intestinal blood flow perfuses the non- 
absorptive and non-metabolizing regions of the small intestine. These 
regions include the serosa, submucosa, and mucosa, excluding the ente- 
rocyte region [139]. Thus, the SFM model recognizes the subtle demar- 
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Fig. 5. Schematics of the traditional model, TM (A) and the segregated, flow 
model, SFM (B). Note that the entire oral dose passes through the enterocyte region 
for TM and SFM, whereas only partial intravenous dose reaches the entrocyte region 
for the SFM. For TM, the intestinal blood ( Qi ) perfuses the entire intestinal tissue, 
the site of metabolism and absorption from the lumen. For SFM, intestinal blood 
is segregated to perfuse the non-metabolizing ( Q 3 ) and enterocyte-mucosal (Qen) 
regions. The drug (D) in the blood spaces equilibrates with those in the corresponding 
tissue layers. Transfer occurs with intrinsic transfer clearances CLd 1 and CLd 2 for 
TM, or CLdi and CLd2, CLdz and CL,u for SFM. The absorptive, metabolic and 
efflux activities within the villus tips of the mucosal layer are represented by the rate 
constant, ka, and metabolic and secretory intrinsic clearances, CLi n t,m and C Lint, sec, 
respectively. Gastrointestinal transit clearance is denoted by CLgit ■ Adapted from 
[139] , with permission. 



cation of tissue layers and distributions in blood supply [140,141]. The 
literature values for the blood flow to the absorptive enterocyte layer of 
the mucosa vary greatly, ranging from 5% to 30% [142,143,141]. For the 
SFM, drug in the lumen must necessarily enter via the enterocyte region 
before reaching the circulation, whereas drug in circulation is primarily 
channelled to other non-metabolizing, tissular regions. The consequence 
is the greater intestinal metabolism for oral over intravenous dosing, even 
when the small intestine is the only removal organ. The properties of 
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Time (min) 



Fig. 6. Fits of tracer [3H] morphine data from the vascularly perfused recirculating, 
rat small intestine preparation to the TM and SFM, after systemic administration 
of dose into the reservoir (data from [139]). There was a total lack of morphine 
glucuronide formed. Only morphine was excreted into the lumen (and luminal fluid 
that was collected as exudates). Note the superior fit of the data to the SFM. Adapted 
from [139], with permission. 



the models were compared; the predicted trends were generally similar 
for both TM and SFM, although the magnitudes differed since the flow 
to the enterocyte region differed. But the SFM was found superior over 
the TM in describing route-dependent intestinal metabolism of tracer 
morphine glucuronidation in the vascularly perfused rat small intestine 
(Figs. 6 and 7) [139]. 

These physiologically-based models developed for the intestine (TM 
and SFM) also provided the theoretical examination of extents of metabo- 
lism and mean residence time (MRT). Under linear conditions, intestinal 
secretion resulted in reduced intestinal metabolite formation, as con- 
firmed in a recent simulation with the TM and SFM [137]. Both secre- 
tion and metabolism reduced biovailability, F, whereas increasing the 
absorption rate constant, ka, increased F. The MRT was prolonged in 
absence of gastrointestinal transit or luminal metabolism (the kg com- 
ponent in Fig. 5), and clearance by other parallel eliminating organs, 
denoted as CLo, was unimportant (Table 4). With loss of drug from 
lumen (kg component > 0), the MRT was reduced. The conclusions 
agree with intuitive and deductive reasoning since reduction of intracel- 
lular substrate concentration in the intestine accompanies drug efflux 
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• M - reservoir 
O M - lumen + exudate 




Time (min) 



Fig. 7. Fits of tracer [3H]morphine data from the vascularly perfused recirculating, 
rat small intestine preparation to the TM and SFM, after duodenal administration of 
dose into the intestinal lumen (data from [139]). Note that morphine glucuronide was 
formed with luminal administration of morphine. Both morphine and morphine glu- 
curonide were found in the lumen and luminal fluids that were collected as exudates. 
Note the superior fit of the data to the SFM. Adapted from [139], with permission. 



at the apical membrane, yielding lower rates of intestinal metabolism 
[144,145]. Increased absorption neutralizes the effect of intestinal secre- 
tion and increased F, whereas increased secretion and metabolism reduce 
the bioavailability, F [137]. 

Segmental Traditional Model (STM) and Segmental 
Segregated Flow Model (SSFM) 

In order to accommodate the attendant heterogeneities of metabolic 
and transporter activities and to examine their impact on intestinal 
clearance and availability, more advanced models have been developed 
[146]. The intestinal compartments of the TM or SFM were expanded 
into three equal segmental compartments, analogous to the zonal mod- 
eling of the liver [91], in the development of the Segmental, Traditional 
Model (STM, Fig. 8A) and the Segmental, Segregated Flow Model 
(SSFM, Fig. 8B) [146]. Varying distributions of absorptive and secretory 
transporters and metabolic enzymes were expected to result in different 
F’s. Through simulations with the STM and SSFM, the highest and 
lowest values of F were found within sets of absorptive, metabolism and 
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Table 4. Trends in mean residence time (MET) in reservoir (subscripted R) and 
intestinal tissue (subscripted int) for intravenous (IV) and oral (PO) dosing for the 
traditional model (TM) and segregated flow model (SFM) when the metabolic intrin- 
sic clearance (CLint.met) was altered from 0 to 4, 8, 12, 16, and 20 ml/min and the 
secretory intrinsic clearance (CLint,sec) was changed from 0 to 4, 8, 12, 16 and 20 
ml/min (taken from Pang et al,, 2003). 



Cases 


TM 


MIITrjv 


M RTint,IV 


M RTR'Po 


M RTinl'PO 


Case 1 

(k g = 0 min -1 ; 
CLq = 0 or 5 ml/min) 
k a = 1 min ~ 1 
k a = 20 min -1 
k a = 200 min -1 


I 


1 


1 


T 


Case 2 

(k g = 3 min -1 ; 
CLo = 0 ml/min) 
k a = 1 min -1 
ka = 20 min -1 
k n — 200 min -1 


1 


1 


1 


1 


Cases 


SFM 


MRTrjv 


M RTint'IV 


MRTr : po 


MRTint.PO 


Case 1 

(k g = 0 min -1 ; 
CLo = 0 or 5 ml/min) 
k a = 1 min -1 
k a = 20 min -1 
ka = 200 min ~ l 


I 


T 


T 


t 


Case 2 

(k g = 3 min -1 ; 
CLo = 0 ml/min) 
k a = 1 min -1 
k a = 20 min -1 
ka = 200 min -1 


1 


1 


1 


1 



secretory activities that were distributed heterogeneously in segments of 
the small intestine (Fig. 9). The predicted trends were generally similar 
for both STM and SSFM, although the magnitudes differed since the 
flow to the enterocyte region differed. Of note is the strong influence of 
the distribution of the metabolic enzymes along the intestinal length on 
F. 

hi other simulations of the STM and SSFM based on equal activ- 
ities for transport and metabolism among the segments, fast absorp- 
tion was found to counteract the effect of the virtual, peripheral com- 
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Fig. 8. The segmental traditional model (STM) that views the intestine as three 
equal segments (or expanded ith compartments where i = 1, 2, or 3) of TM (Fig. 
5A) with metabolism taking place to form the metabolite (M) in tissue (A), and 
the segmental segregated flow model (SSFM), an expansion of the SFM (Fig. 5B) 
that views the intestine as three segments (or expanded ith compartments where i = 
1, 2, or 3) with low and partial flow to the enterocyte regions (fqQint/ 3 where }q 
is the fraction of total intestinal flow (Qint) perfusing the enterocyte region where 
the enzymes and efflux transporters reside) and bulk flow [(1 - fqjQint/ 3] to the 
“other” noneliminating region (B). Drug (D) in blood equilibrates with that in the 
corresponding tissue with intrinsic transfer clearances CLd \ and CLdi for both STM 
and SSFM. The absorptive, metabolic and efflux activities within the villus tips of the 
mucosal layer are represented by the rate constant, Ar 0 «, and metabolic and secretory 
intrinsic clearances , CLi nt ,meti and C Lint, sect, respectively, for each i ih segment. 
The gastrointestinal transit clearance is denoted by CLgitx- Adapted from [146], 
with permission. 



partment. This reduced the MRT of drug in intestinal tissue, MRT ce u 
[146]. Increases in the metabolic intrinsic clearance within the segments 
(C Li n t, m eti) as well as gastrointestinal transit/clearance ( CLqit\ ) de- 
creased the MRT whereas a greater secretory intrinsic clearance ( CLi n t iS ed ) 
followed by reabsorption increased the MRT ce a [146]. Rapid absorption 
of the secreted species cancelled the effect of intestinal secretion and in- 
creased the F (Fig. 10), whereas high metabolic ( CLi n t, m eti ) an d secre- 
tory (C Lint,, sect) intrinsic clearances within the segments as high CLgitx 
reduced F (Fig. 11). These findings mirrored those found for the TM 
and SFM [137]. 







19 



CL int .meti 
(ml/min) 



(min' 1 ) 

CL intiseci 

(ml/min) 

Fig. 9. Simulations of the STM and SSFM, with even distribution of transporter 
and metabolic activities within the three segments. Segmental distributions (des- 
ignated as ith segments where i = 1, 2, and 3) of the metabolic intrinsic clear- 
ance, CLi„t,meti(0.9,0.5, and 0.1 ml/min), the absorption rate constant, fc 0 *(5, 3, 
and 1mm -1 ), and the secretory intrinsic clearance, CL intt , ec i( 2,4, and 6 ml/min) 
furnished the lowest bioavailability (F) (left panel) and the highest F (right panel, 
C Lint t seci were 2, 4, and 6 ml/min, CLi nt ,mtu were 0.1, 0.5, and 0.9 ml/min, and 
kai were 5, 3, and 1 min -1 ). The predicted difference existed in the descending vs. 
ascending, segmental distributions of the metabolic intrinsic clearance, 
within the segments 1, 2, and 3. Taken from [146], with permission. 




CONCLUDING REMARKS 

Model development of drug metabolism and transport by the intes- 
tine is at the infancy stages. The efflux/absorptive transporters operate 
in opposite directions at the apical membrane, with intestinal motil- 
ity removing the drug in lumen to out of the body, thereby preventing 
reabsorption and metabolism. The peculiarity in intestinal blood flow 
to the active metabolic and absoiptive further brings about the unique 
aspect of route-dependent intestinal metabolism/excretion [139]. The 
overall estimate of bioavailability is therefore, the result of membrane 
peimeation by different arrays of transporters operating in the same or 
opposite directions, and preferential flow to cellular regions that con- 
tain the enzymatic activities. Consequently, good correlation between 
in vitro and in vivo data for intestinal drug metabolism/removal would 
not be easy. 

The present modeling approach had omitted the consideration of drug 
dissolution kinetics. It is envisaged that modeling will be more complete 
and improved upon coupling of the drug-release phase of the ACAT 
model [132]. The intermittent release of drugs in various aggregated 
and de-aggregated forms from the stomach, gastric emptying, bile salt 
effects, and the presence of mucus may need to be considered. Proof of 
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(A) 




(B) 




Xr ai (min' 1 ) 



Fig. 10, Simulations performed for the STM (A) and SSFM (B) to demonstrate 
the effect of the segmental, gastrointestinal transit clearance (CLg/t< = 0.01 or 0.1 
ml/min), the secretory intrinsic clearance (CLint,seci), and the absorption rate con- 
stant (kai) on the intestinal (systemic) bioavailability, F. The metabolic intrinsic 
clearance, CLi nttme ti , was kept at 0.1 ml/min; the partitioning clearance of drug, 
CLdi and CLd 2 , were 0.9 ml/min. All the intestinal segments (i = 1, 2, and 3) 
were assigned to be equal and contained equal amounts of transporter, metabolic, 
and absorptive activities. Adapted from [146], with permission. 
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(A) 




(B) 




Fig. 11. Simulations performed for the STM (A) and SSFM (B) to demonstrate 
the effect of the gastrointestinal transit clearance (CLg;t« = 0.01 or 0.1 ml/min), 
the secretory intrinsic clearance ( CLi nt ,, e ci ), and the metabolic intrinsic clearance 
(CLint.mcti) on the intestinal (systemic) bioavailability, F. The absorption rate con- 
stant for the each segment, k a i, was kept at 3 min -1 ; the partitioning clearance for 
drug, CLd i and CLd 2 , were 0.9 ml/min. All the intestinal segments (i = 1, 2, and 
3) were assigned to be equal and contained equal amounts of transporter, metabolic, 
and absorptive activities. Adapted from [146], with permission. 
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the model relies not only on drug but also metabolite disposition during 
both oral and systemic dosing. Needless to say, the liver should be 
properly considered for first-pass removal and systemic bioavailability. It 
is expected that the consolidation of physiologically-based intestinal and 
liver models will greatly improve our predictiveness on drug absorption, 
bioavailability and intestinal metabolism. The challenge remains in the 
scale-up of the conceptual frameworks developed for the TM, SFM, SFM 
and SSFM. The physiological parameters for the flow rate and volume, 
as well as more appropriate intestinal transit times need to be properly 
addressed for modeling of human absorption. 
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INTRODUCTION 

Lamivudine (3TC) is a nucleoside analog that inhibits the replication 
of the human immunodeficiency vims type I (HIV1) in vitro and in vivo 
[1,2]. As with other nucleoside analogs (e.g. zidovudine-ZDV) , 3TC 
must be metabol iz ed to its triphosphate by intracellular kinases to in- 
hibit viral DNA synthesis [3]. Nucleoside analog triphosphates inhibit 
the viral enzyme reverse transcriptase, which is responsible for catalyzing 
the formation of complimentary viral DNA using the host cells endoge- 
nous nucleosides as substrates. Viral replication is inhibited as a result 
of 1) the competitive binding of the endogenous and analog nucleoside 
triphosphates to reverse transcriptase binding sites, and 2) the resulting 
chain termination that occurs following incorporation of the nucleotide 
analog into proviral DNA [4,5]. 

The transport of nucleoside analogs (including 3TC) into and out of 
cells and their phosphorylation by cellular kinases is complex process 
that is subject to considerable variability. A better quantitative under- 
standing of the relation between the plasma concentration of 3TC and its 
intracellular triphosphate may provide a basis for more efficient therapy 
with this nucleoside analog. 

This chapter reports on a model for cellular phosphorylation of 3TC 
developed from in vitro experiments using human peripheral blood mono- 
nuclear cells (PBMC), as well as model predictions of the ki netics of 
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3TC phosphorylation in vivo. The in vitro experiments include both 
resting and stimulated PBMC exposed to two concentrations of 3TC. A 
model for intracellular phosphorylation is developed that incorporates 
the known enzymes of the cytidine pathway, which are thought to be 
responsible for 3TC phosphorylation [61 . The model developed from in 
vitro PBMC is then used to predict cellular phosphates measured in 
PMBC of subjects undergoing anti-retroviral therapy as part of a Pedi- 
atric AIDS Clinical Trial Group study. 



METHODS 
In Vitro Studies 

Blood from healthy subjects was used to isolate peripheral blood 
mononuclear cells (PBMC) by centrifugation over ficoll hypaque. The re- 
sulting mononuclear cells were suspended to a density of 1 x 10 6 cells per 
ml in fresh growth media. The PBMC suspensions were kept overnight 
at 37 °C in a CO 2 incubator and the non adherent cells (primarily lym- 
phocytes) were split into separate fractions and incubated for 1, 2, 4, 
16 and 24 hours with unlabeled 3TC and with [ 3 H] 3TC. Two sets of 
experiments were conducted in which the cells were exposed to either 1 
or 5 3TC with 0.5 and 2.5 /xCi/ml label respectively, maintaining a 
specific activity of 0.5 /iCi//dVl. Nucleotides were extracted from PBMC 
with 200 fil per 10 7 cells of 70% methanol buffered to pH 7.4 with 0.015 
M tris (final concentration) and kept at - 20 °C until analysis. 

An additional set of experiments was conducted in phytohemagglu- 
tinin (PHA) stimulated cells to increase kinase activity. For some nucle- 
oside analogs (e.g., ZDV), the metabolism in PBMC in vivo is similar 
to that of stimulated cells in vitro. In the stimulated cell experiments, 
the cells were activated with PHA 5 pg/ml and 10 U per ml IL-2 for 72 
h at 37 °C prior to exposure to 3TC at either 1 or 5 fi M as described 
above for resting cells. 

Lamivudine metabolites were separated by HPLC using a 250 mm 
Whatman Partisil 10 SAX anion exchange column. The separation in- 
volved a series of linear gradients from the low salt buffer A (5 inM 
ammonium phosphate at pH 4.0) to the high salt buffer B (600 mM 
ammonium phosphate at pH 4.0) at a flow rate of 1.5 ml/min. The gra- 
dient was 10 min at 0-30 %B, 10 min at 30-40 %B and finally at 22 min 
at 100 %B. Fractions were collected at 40 second intervals with 9 ml of 
scintillation cocktail added and the fractions counted on a Beckman LS 
6000 scintillation counter. 
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Clinical Studies 

Study subjects were all 12 to 24 years of age with a confirmed diag- 
nosis of HIV- 1 infection (two positive assays from two different samples) 
and a CD4 count greater than 250 cells/^1 (Pediatric AIDS Clinical 
Trials Group PACTG 1012). Subjects received at least four weeks of 
therapy with ZDV and 3TC (as individual drugs or Combivir) prior to 
the study, with no intention to change any component of the pre-entry 
antiretroviral regimen during the course of the study. Some subjects 
included concurrent therapy with protease inhibitor and non-nucleoside 
reverse transcriptase inhibitor. 

Study subjects were ad mi nistered Combivir (ZDV 300 mg/3TC 150 
mg) on the usual BID schedule for one week and were assessed to have 
70% or greater adherence. Subjects were then switched to a QD regimen 
for a period of seven days before pharmacokinetic assessment. Blood 
samples were obtained immediately prior to the day eight dose, and at 
2, 4, 6, 12 and 24 hours after the dose. 

Samples from 27 subjects were used for intracellular pharmacokinetic 
analysis, 24 of whom had evaluable plasma data. For each blood sam- 
ple, 3TC plasma concentration and PBMC concentration of intracellular 
3TC and its mono-, di- and triphosphate, along with the chol in e anabo- 
lite of 3TC diphosphate. The measurements were performed using a 
modification of the procedure used for ZDV described in [6]. 

Modeling of 3TC Metabolism from In Vitro Data 

Information on the known mechanism of transport and metabolism 
of 3TC was used to construct an initial pathway model for the cellular 
uptake and phosphorylation of 3TC in PBMC. Exploratory analysis of 
the raw data from the resting and stimulated experiments was used to 
identify nonlinear metabolic steps as well as any metabolites undergoing 
rapid equilibrium. Data from the 1/zM and 5^xM resting cell experiments 
were then combined and used to estimate the parameters of various can- 
didate models defined from the pathway model. Model estimation was 
performed by the ADAPT pharmacokinetic/pharmacodynamic systems 
analysis software [7] using maximum likelihood estimation with linear 
error variance measurement structure (slope and intercept estimated). 
The Akaike Information Criterion (AIC), estimated parameter variances, 
and model fit were used to guide model selection. Si mi lar modeling 
analysis was performed using the data from the stimulated cell experi- 
ments. It was assumed that the pathways and processes involved in the 
transport and metabolism of 3TC in PBMC remain the same for both 
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stimulated and resting cells, but that the rate processes could change 
due to stimulation. 

Prediction and Modeling of Clinical Study Data 

As part of a preliminary analysis of in vivo intracellular 3TC metabo- 
lism, the plasma 3TC concentration-time data from 24 subjects were 
pooled and fitted to an exponential model. The fitted curve was then 
used as the extracellular input to predict the intracellular 3TC metabo- 
lites in the subject population. Since most of the PBMC in patients are 
resting cells, the model derived from the in vitro resting cell data was 
used in this analysis. The predicted results were compared to the mean 
and standard deviations of the intracellular metabolite data measure at 
each of the six time points in the 27 patients for which intracellular 
measurements were available. 



RESULTS 

Model of 3TC Metabolism from In Vitro Data 

Figure 1 shows the pathways and intermediates for 3TC transport 
and metabolism. Hydrophilic nucleoside analogs are transported across 
the cell membrane mainly by functional transporters. Transport of nu- 
cleoside analogs such as 3TC is achieved through human equilibrative 
nucleoside transporters, which translocate 3TC down its concentration 
gradient (steps 1 and 2 in Fig. 1) [8]. Phosphorylation of 3TC to 
3TCMP and dephosphorylation of 3TCMP (steps 3 and 4) are both 
catalyzed by the enzyme deoxycytidine kinase [9, 10], while phosphory- 
lation of 3TCMP to 3TCDP and dephosphorylation of 3TCDP (steps 
5 and 6) are catalyzed by CMP/dCMP kinase [11]. Phosphorylation 
of 3TCDP to 3TCTP and dephosphorylation of 3TCTP (steps 7 and 
8) are mainly catalyzed by 3-phosphoglycerate kinase, but NDP kin ases 
can also catalyze these reactions. For endogenous nucleoside diphos- 
phates the corresponding reactions are catalyzed by NDP kinases, but 
the affinity of these enzymes for 3TCDP is thought to be too low to 
be the main enzyme responsible for phosphorylating 3TCDP in PBMC 
[12-14], 

The reaction governing the formation of 3TCDP-choline (step 9) is 
3TCTP + P-choline ^ 3TCDP-choline + PP, which is catalyzed by P- 
choline cytidyl transferase. In the model shown in Fig. 1 we assumed 
that the concentration of P-choline is constant thus simplifying the re- 
action to a first-order process. Another reaction that may play a role 
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is 3TCTP + P-ethanolamine ^ 3TCDP-ethanolamine + PP; the en- 
zyme catalyzing this reaction is P-ethanolamine-cytidyl transferase [15]. 
This reaction, however, is less significant than the P-choline reaction and 
the HPLC method employed might not be able to differentiate 3TCDP- 
choline and 3TCDP-ethanolamine. The reaction for salvaging 3TCMP 
(step 10) is 3TCDP-choline + D-a, /3-diglyceride ;=± lecithin + 3TCMP; 
it is catalyzed by P-choline glyceride transferase. In the model shown 
in Fig. 1 we assumed that the concentration of D-a, /3-diglyceride is 
constant and simplified this reaction to a first-order process. There are 
also literature reports that the reaction 3TCDP-ethanolamine + D-a, /3- 
diglyceride ?=* phosphatidylethanolamine + 3TCMP might also exist. 
This reaction is catalyzed by P-ethanolamine-glyceride transferase [15]. 
The model of Fig. 1 combines these two salvage pathways into a single 
first order reaction. Finally degradation of 3TCTP and binding to re- 
verse transcriptase and incorporation into DNA (step 11) are modeled 
together as a single first-order process. 



Extracellular 



3TC 



l 




3TC 



Cellular 

3 5 

^ 3TCMP „ > 3TCDP 

4 A 6 




II 

± 3TCTP 



3TCDP-cholinc - 



Fig. 1. Pathway model of 3TC transport and metabolism in PBMC. 



From inspection of the raw data it appears that at 24 hours all the 
metabolites, with the exception of the 3TCDP-choline anabolite, have 
reached their steady state values. If all processes are linear, then at 
steady state a 5-fold increase in the extracellular 3TC concentration 
(1/iM and 5/iM) would result in a 5-fold increase in all the metabolites. 
To assess the linearity of the system, the mean of the concentrations of 
each species at 24 hours are compared in Table 1. While the concen- 
tration of intracellular 3TC increased 5-fold between the 1/rM and 5/rM 
experiments, the increase in the concentrations of all the metabolites is 
considerably less. For resting PBMC, the increase is around 2.1 for each 
metabolite, and for stimulated PBMC the increase is around 2.6 for each 
metabolite. Based on this analysis of the raw data and reports from the 
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literature [9], we concluded that the phosphorylation of 3TC (step 3 in 
Fig. 1) is saturable and has been modeled as Michaelis-Menten reaction. 



Table 1. Mean and standard deviation of 3TC metabolites at 24 hours and the 
ratio of the metabolites obtained from lpM and 5pM experiments. 





Resting PBMC 


1/iM* 


5/rM‘ 


5pM/lpM 2 


3TC(intracellular) 


0.33±0.05 


1.80T0.29 


5.43 


3TCMP 


0.28T0.08 


0.59T0.18 


2.08 


3TCDP 


1.82 ±0.26 


4.00±0.64 


2.21 


3TCTP 


1.61±0.21 


3.41±0.61 


2.12 


3TCDP-choline 


0.64T0.09 


1.32±0.19 


2.05 



Stimulated PBMC 





1/rM 1 


5pM‘ 


5/xM/l/iM* 1 


3TC(intracellular) 


0.57±0.06 


2.77±0.19 


4.84 


3TCMP 


0.51±0.06 


1.38±0.09 


2.69 


3TCDP 


2.16±0.33 


5.68±0.30 


2.63 


3TCTP 


3.02±0.82 


8.28±1.72 


2.74 


3TCDP-choline 


0.21±0.05 


0.52±0.10 


2.52 



1 mean±SD; concentration is pmo// 10® cel Is 

2 ratio of mean, 5pM to lpM. 



Table 2. Ratio of 3TCTP to 3TCDP at each time point. 



Time 

(hours) 


Resting PBMC 


|Stimulated PBMC 


1/xM 


5pM 


1/iM 


5pM 


1 


1.07 


0.58 


1.60 


1.62 


2 


1.01 


0.58 


1.69 


1.61 


4 


0.99 


0.61 


1.72 


1.75 


12 


1.03 


0.97 


1.38 


1.37 


24 


0.89 


0.85 


1.40 


1.46 



Inspection of the raw data also suggests that 3TCTP and 3TCDP are 
in rapid equilibrium. Table 2 shows the ratio of the average 3TCTP and 
3TCDP concentrations at the five time points in each of the experiments. 
Since these ratios change little over time in each experiment, 3TCDP 
and 3TCTP were assumed to be in equilibrium in the model, and their 
ratio was represented by an unknown partition coefficient. The model 
shown in Fig. 2 reflects the simplifications made based on the foregoing 
exploratory analysis of the raw data. 
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Fig. 2. Modified model of 3TC metabolism. For resting cells, AIC=-253, Kdeg— 
3.3xl0~ 8 h _1 , Kmp-dp=181 h _1 , Kdp_mp=24.5 h _1 . For stimulated PBMC, 
AIC = -79, K deg = 1.4xl0 -6 h _1 , with CV= 2.7xl0 6 %, Kjwp_dp=76.9 h _1 with 
CV=149%, Kdp- MP = 19.2 h _1 with CV=156%. 



Using the model in Fig. 2, the maximum likelihood estimates of the 
model parameters were obtained using the combined data (1/iiM and 
5/ixM) from the resting experiments, and also using the combined data 
from the stimulated cell experiments. In both cases the estimate of K 
was less than 10~ 5 , implying that the breakdown of the triphosphate 
was negligible, which is consistent with no measurable catabolites de- 
tected using HPLC. In addition, the values of estimated rate constants 
K mp-DP and K DP-MP were at least a magnitude larger than the other 
rate constants for both resting and stimulated experiments, suggesting 
relatively fast equilibrium between 3TCMP and 3TCDP. 

Based on theses results the model shown in Fig. 3 was then ana- 
lyzed, resulting in the parameter estimates listed in Table 3 along with 
the CV% of the estimates. Figure 4 shows the concentrations of the 
intracellular 3TC and the four metabolites predicted from the estimated 
model obtained using the resting cell data, while Fig. 5 shows the cor- 
responding model predictions obtained from the stimulated cell data. 

Prediction and Modeling of Clinical Study Data 

The exponential model fitted to the average 3TC plasma concentra- 
tion data shown in Fig. 6 was used as the extracellular 3TC concen- 
tration in the model of Fig. 3. Using the resting cell model parameter 
values in Table 3, the steady state concentration time profiles of the 
intracellular metabolites were predicted. Figure 7 shows the predicted 
concentration of 3TCMP, 3TCDP, 3TCTP and the choline anabolite 
along with the measured concentrations (mean and standard deviation) 
of each species. 
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Fig. 3. The final model for transport and intracellular metabolism of 3TC in 
PBMC. For resting cells AIC=-257, for stimulated cells AIC=-82. 



Table 3. Parameter estimates for the model in Fig. 3 for resting and stimulated 
cells. 



Parameter 


Units 


Resting 


Stimulated 


Estimate 


CV% 


Estimate 


CV% 


Ki n 


(pmol / 10 b cells//iM/hr) 


0.70 


11 


3.9 


32 


Kout 


(hr' 1 ) 


2.0 


15 


6.1 


36 


V max 


(pmol/ 10 6 cells/hr) 


2.2 


13 


6.8 


9.7 


K m 


(pmo//10 6 cells) 


0.56 


7.3 


3.1 


13 


K.MP-3TC 


(hr- 1 ) 


3.0 


11 


2.4 


12 


Rdp/a;p 


(-) 


7.3 


7.4 


3.8 


6.9 


Rtp/dp 


(-) 


0.81 


4.9 


1.6 


4.6 


K tp-ch 


(hr- 1 ) 


0.086 


9.0 


0.091 


22 


Kch-mp 


(hr- 1 ) 


0.19 


13 


1.4 


24 



While the in vitro model for 3TC metabolism under predicts the mea- 
sured in vivo concentration of the mono-, di-, triphosphates of 3TC 
somewhat, there is a large under prediction of 3TCDP-choline. One pos- 
sible explanation for the under prediction of the in vivo data involves the 
simplification of the 3TCMP salvage pathway that was implemented in 
the in vitro model. In the discussion describing the pathways shown in 
Fig. 1, it was noted that two salvage pathways are present, the 3TCDP- 
choline pathway that was included in the model and the slower parallel 
3TCDP-ethanolamine pathway that was not. Our measurements may, 
however, include both of these species. While this slower pathway may 
not contribute appreciably to the 3TCMP salvage over the short 24 
hour time frame represented by the in vitro experiments, over the ther- 
apeutic dose regimen period of the clinical studies, the slower 3TCDP- 
ethanolamine pathway may contribute to an elevated baseline of the 
choline and ethanolamine anabolites. To test this idea, the model in Fig. 
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Time (hours) 



Fig. 4. In vitro model predictions and measured data for resting cells. 



3 was modified to include the parallel ethanolamine pathway in which 
the 3TCDP-ethanolamine formation and breakdown rate constants were 
arbitrarily set at 2% and 0.75% of their respective values for the 3TCDP- 
choline pathway as given in Table 3. The resulting model prediction of 
the sum of the choline and ethanolamine anabolites is shown in Fig. 8. 
The predicted values of the mono-, di- and triphosphates are essentially 
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Fig. 5. In vitro model predictions and measured data for stimulated cells. 



unchanged from those shown in Fig. 7. (Also, the simulation of the in 
vitro resting experiment with the modified model shows the same results 
as given in Fig. 4.) 
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Plasma 3TC 




Fig. 6. Exponential model fit to the 3TC mean plasma data: 9, 32 x (exp — 0.26t — 
exp— 1.09t) + 0.18. 



DISCUSSION 

Based on the known mecha ni sms of intracellular phosphorylation of 
the nucleoside analog 3TC, together with in vitro measurements of phos- 
phate concentrations obtained from human PBMC following exposure to 
extracellular 3TC, a model was developed to quantify the kinetics of the 
cellular 3TC phosphorylation pathway. This model was then used in a 
preliminary analysis to predict measurements made in patients under- 
going anti-retroviral therapy. 

The results of the modeling analysis of the in vitro experiments in- 
dicate some differences in the kinetics of phosphorylation between the 
resting and stimulated cells. Results listed in Table 3 indicate that 
stimulation of the cell increases the transport rate constants of 3TC into 
(Ki„) and out of iKout) the cell. This increase, however, can be ex- 
plained on the basis of the increase in cell volume and therefore surface 
area that accompanies stimulation, not on an increase in the intrinsic 
transport mechanisms. Comparing the values of K tp-CH in Table 3 
indicates that the kinetics of formation of 3TCDP-choline are sim il ar in 
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Fig. 7. Prediction of the clinical study 3TC metabolites concentrations. 



resting and stimulated cells. The difference in the values of K cH-mp 
between the resting and the stimulated cells suggests that the ki netics 
of the salvage pathway increases significantly in stimulated cells. This 
explains why in stimulated cells a higher triphosphate concentration can 
yield a lower 3TCDP-choline concentration (compare Figs. 4 and 5). It 
is also of note that in stimulated cells V max is greater in resting cells 
(6.8 versus 2.2 pmol/10 6 cells/hr). Thus for high concentrations of 3TC, 
phosphorylation is faster in stimulated cells than in resting cells. How- 
ever, K m is also larger in stimulated cells compared to resting cells (3. 1 
versus 0.56 pmol/10 6 cells), resulting in faster phosphorylation of 3TC 
in resting cells compared to stimulated cells at lower concentrations of 
3TC. 

The model in Fig. 3 assumes that intracellular phosphorylation of 
3TC occurs in a single homogeneous cellular compartment. Literature 
reports [16, 17] indicate that there are two distinct intracellular dCTP 
pools; one is composed of dCTP synthesized from the de novo pathway 
(used in DNA replication), and the other is dCTP retrieved from the 
salvage pathway (used for DNA repair). Moreover, the phosphorylation 
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3TCDP-choline 




Fig. 8. Steady state choline+ethanolamine anaboljte predictions. 



in each pool is distributed over numerous distinct compartments within 
the cell. Since lamivudine is the analog of deoxycytidine (dCyd), its 
phosphorylation should occur in each of the compartments thoughout 
the PBMC that are associated with the salvage pathway. Due to absence 
of any data on this distributed compaitmentation of the phosphorylation 
process, our model assumes a single compartment and thus the estimate 
kinetic parameters values should be interpreted carefully. 

The model developed from the in vitro data gave reasonable esti- 
mates of the average clinical study data; however a more formal anal- 
ysis is needed to develop a model for the in vivo phosphorylation of 
3TC in the population. In particular, the variability observed in the 
measured concentration of the 3TC phosphates in vivo is considerably 
greater than that predicted by the in vitro model. Numerous factors may 
account for the increased variability in the ki netics of phosphorylation 
observed in the patients undergoing antiviral therapy compared to that 
observed in the human PBMC cell culture experiments, including the 
effect of concomitant ZDV administration [18]. An interaction between 
3TC and ZDV metabolism could be mediated through the NDP kin ases 
that catalyze the last phosphorylation step for both 3TC and ZDV. An- 
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other source of variability that was not addressed when applying the in 
vitro data to the in vivo data is the difference in the expression of nu- 
cleotide transporters between the two experiments. Transporters such 
as MRP4 transport nucleoside analog monophosphates out of the cell. 
Prolonged exposure to nucleotide analogs during anti-HIV therapy may 
induce a higher expression level of the transporters in some subgroups in 
the population, leading to greater intersubject variability in intracellular 
phosphorylation [19, 20]. 
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INTRODUCTION 

Pharmacokinetic/pharmacodynamic (PK/PD) modeling is increas- 
ingly applied in drug discovery and pre-clinical development. Specific ap- 
plications include a) selection of drug candidates with the most favourable 
pharmacokinetic (PK) and pharmacodynamic (PD) properties, b) de- 
velopment, evaluation and validation of (complex) PK/PD models, c) 
development evaluation and validation of biomarkers and d) prediction 
of exposure-response to optimise the design of clinical trials. Not sur- 
prisingly there is a clear trend towards the development of mechanism- 
based PK/PD models with improved properties for extrapolation and 
prediction. A key element in these models is the incorporation of princi- 
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pies from receptor theory and dynamical systems analysis. Specifically, 
mechanism-based PD models contain separate expressions to character- 
ize the drug-receptor interaction (in terms of binding and activation) 
and the transduction process. 

In this chapter the development of mechanism-based pharmacody- 
namic models is discussed with special reference to the incorporation of 
i) receptor theory and ii) dynamical systems analysis. 

Incorporation of Receptor Theory 

To date, pharmacokinetic -pharmacodynamic studies have mostly em- 
ployed empirical models such as the Hill equation to describe drug 
concentration-effect relationships in vivo. A limitation of this prac- 
tice is however that empirical models provide only limited insight in 
the underlying factors that determine the shape and the location of the 
concentration-effect curve. Specifically, the potency (i.e., the EC$ o) of 
a compound depends on both the receptor affinity and the intrinsic ef- 
ficacy. Furthermore the intrinsic activity (i.e., maximal effect E max ) is 
a function of both compound (the intrinsic efficacy) and system (the re- 
ceptor density and the function relating receptor occupancy to pharma- 
cological effect) characteristics (Fig. 1). Classical receptor theory explic- 
itly separates drug- specific properties and system-specific properties as 
determinants of the drug concentration-effect relationship. Therefore we 
have developed mechanism-based pharmacokinetic -pharniacodynamic 
modeling strategies based on concepts from receptor theoiy, that can 
provide insight in the factors that determine pharniacodynamic behaviour 
in vivo [1], Thereby two different approaches have been followed: i) a full 
parametric approach based on the operational model of agonism, which 
utilises a hyperbolic transducer function; and ii) a semi-parametric ap- 
proach which utilises a flexible, monotonically increasing function to 
describe the stimulus-effect relationship. 

Full Parametric Approach: Operational Model of 
Agonism 

Classical receptor theory combines 2 independent parts to describe 
drug action: an agonist-dependent part, incorporating agonist affinity 
and intrinsic efficacy, and a tissue-dependent part, determined by the 
receptor concentration and the nature of the stimulus-response relation- 
ship [2], The basic form of Black and Leffs [2] operational model of 
agonism is based on the observation that most agonist concentration- 
effect curves are hyperbolic in nature. The authors proved algebraically 
that if the law of mass action governs the binding of an agonist to the 
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Fig. 1 . The pharmacodynamic characteristics of a drug (potency, intrinsic activity 
and the slope of the concentration-effect relationship) are not only dependent on 
drug-specific properties (affinity and intrinsic efficacy at the receptor) but also on the 
properties of the biological system. This has important implications for the inter- 
species interpolation of drug effects, Bince the biological system may differ between 
species. Furthermore, various factors (i.e., disease, age, chronic treatment, or other 
drugs) may change the biological system and therefore modify the pharmacological 
response. This can therefore explain inter- and intra-individual variability in drug 
response [1]. 



receptor according to: 

[floi x Ml 

Ka + [A] 

where [A/?] is the concentration of agonist- receptor complex, Rq is the 
total receptor concentration, [A] the concentration of the agonist and K& 
is the dissociation equilibrium constant of AR , that then the function 
relating the concentration of agonist-receptor complex and the response 
can only be a straight line or another hyperbolic. In order to allow for 
receptor reserve, the linear alternative needs to be rejected in favor of 
the hyperbolic transducer function, which can be represented as follows: 




E _ [AR] n 
E m ~ K%+ [AR\ n 



( 2 ) 



where E is the response, E m is the maximum system response, Kg 
is the value midpoint location of the transducer function ([AT?] that 
elicits half-maximal effect) and n is the slope of the transducer function. 
Combining Eqs. 1 and 2 yields an explicit model of agonism describing 
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the pharmacological effect in terms of agonist concentration. 

F = E m xr n x [A] n 

( K a + [A]) n + r" x [A} n W 

where the transducer ratio r is defined as Rq/Ke and is a measure of 
the efficiency of the occupied receptors into the pharmacological effect. 

Leff et al. [3] have shown that the operational model of agonism can be 
employed to obtain estimates of affinity and intrinsic efficacy of a partial 
agonist by comparison with a full agonist. This so-called comparative 
method [4] is based on the idea that for a full agonist the Hill equation 
parameters, intrinsic activity (a) and Hill slope (n#), are identical to 
the operational model parameters E m and n respectively. Therefore, it 
has been shown that when E m and n are constrained to the estimates of 
a and n// for the full agonist respectively, Ka and r for a partial agonist 
can be estimated by directly fitting the concentration-effect data to Eq. 
3. To date the application of the comparative method has been confined 
mainly to in vitro studies. Recently however, the operational model of 
agonism has been successfully applied in a series of in vivo investigations 
on the PK/PD correlations of Ai adenosine receptor agonists [5,6], OP 3 
opioid receptor agonists [7,8] and5-HTiA serotonin receptor agonists [9]. 
Simultaneous analysis of in vivo concentration-effect relationships of dif- 
ferent compounds within the non-linear mixed effects modeling software 
NONMEM, assuming a single unique transducer function, has been a 
key feature in these applications. 

Semi-Parametric Approach to the Incorporation of 
Receptor Theory 

A limitation of the operational model of agonism is that it contains 
a strict assumption on the shape of the transducer function. In the- 
ory a transducer function can take any shape. For this reason a semi- 
parametric approach to the incorporation of receptor theory in PK/PD 
modeling has also been proposed. In this approach the interaction be- 
tween the drug and its receptor is still described by a hyperbolic function 
(Eq. 1), but no specific assumptions are made on the shape of the trans- 
ducer function. The approach is based on Stephensons concept of a 
stimulus to the biological system as result of the drug-receptor interac- 
tion [10]. This stimulus is defined as: 

c _ E x l AR ) _ « * [A] U] 

m [a] +k a y) 

where S is the stimulus to the biological system and e is a dimension- 
less proportionality factor denoting the power of a drug to produce a 
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response in a tissue. An important feature is that the intensity of the 
drug response is not directly proportional to the stimulus. Instead the 
response is assumed to be a function/of the stimulus that is monoton- 
ically increasing and continuous: 

(5 > 

Thus the introduction of the function /dissociates receptor stimulus 
and tissue response as directly proportional quantities. This also elim- 
inates the necessity for the assumption that a maximal tissue response 
requires maximal receptor occupancy. This model was further refined 
by Furchgott [11], who defined the concept of intrinsic efficacy e as 



e = e/Ro (6) 

thereby directly incorporating the receptor density Rq into the model as 
major determinant of the effect. The factor e characterizes the capacity 
of a drug to initiate a stimulus per receptor. Thus e is a strictly drug- 
specific parameter. The incorporation of the term e into Eq. 5 results 
in 



E a 

En lax 



= f(S) = f 



g X [go] [A] 
[A] + K a 



(7) 



The semi-parametric approach to the incorporation of receptor theory 
in PK/PD modeling has been successfully applied in investigations on 
GABAa receptor agonists [12]. 



INCORPORATION OF DYNAMICAL SYSTEMS 
ANALYSIS THEORY 

Mechanism-based pharmacodynamic models, such as the operational 
model of agonism are useful to describe and predict steady-state drug 
concentration-effect relationships. However they do not account for time 
dependencies related to the development and the dissipation of the drug 
effect. As an approach to account for delays between the drug concen- 
tration and the effect, Dayneka et al., [13] have proposed a family of four 
indirect response models on the basis of the following equation 

— ki n kout X R (8) 

where R is a physiological entity, which is constantly being produced and 
eli mi nated in time, ki n is the zero^-order rate constant for production 
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of the physiological entity and is the first-order rate constant for 
its loss. In this model the drug effect is described as stimulation or 
inhibition on the factors controlling either the input or the dissipation 
of drug response in a direct concentration-dependent manner. 

Apart from being able to describe a delay in the pharmacological 
response relative to the drug concentration in plasma, there is a growing 
need for methods to describe and predict complex pharmacological effect 
versus time profiles. Such complex profiles may be observed when drug 
exposure leads to tolerance/sensitisation or when homeostatic feedback 
mechanisms are operative. An example of a model to describe complex 
effect versus time profiles is the so-called push-and-pull model depicted 
in Eq. 9. 

k in X (1 - / (C)) - kont x Rx M 
k m x R- km x M (9) 

where C is the drug concentration, M is a response modifier value and 
k rn is a first-order rate constant for development and dissipation of the 
response modifier. Due to its plasticity the push-and-pull could be suc- 
cessfully applied to describe tolerance to the diuretic response upon re- 
peated administration of furosemide [14]. 

Another example of a PK/PD model describing tolerance is the pre- 
cursor pool model shown in Eq. 10. 



flR 

dt 

dM 

dt 



dPool 

dt 

dM 

dt 



— ki n ki oss X Pool X (1 f (C)) 

— ^loss x Pool X (1 f ((9)) k(ynt x R 



( 10 ) 



where Pool is the precursor pool value and ki OS3 is the first-order rate 
constant for release of the precursor into the central compartment. The 
precursor pool model can conceptually be considered a description of 
a tachyphylactic system. The model has been successfully applied to 
describe the effects of neuroleptic drugs on prolactin balance [15]. 

Attempts to model physiological counter regulatory mechanisms has 
resulted in a series of advanced models describing complex behaviour. 
These models are in part based on the work by Ekblad and Licko [16]. 
An example is the model proposed by Bauer et al., to characterize toler- 
ance to the haemodynamic effects of nitroglycerin in experimental heart 
failure. 



dCflTTG 



- k e i x Cntg 



dt 



v 
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(ICntg 
m x — - — 
dt 

k\ x M x Cntg — k% x M 

Ai 2 x M k e i t 2 x E c (11) 

where ki n is the function controlling the input of nitroglycerin ( NTG ) 
into the system, V is the volume of distribution, fci, k e i and k e i t 2 are 
the first-order rate constants for the generation of mediating factors, M 
is the response modulator values, Ed is the direct drug effect and E c 
is the counter-regulatory effect. In the meantime this type of physio- 
logical counter-regulatory effect models has been successfully applied to 
describe tolerance and rebound to the effects of drugs such as alfentanil 
and omeprazole [18,19,20]. 

The most recent development in the incorporation of dynamical sys- 
tems analysis in PK/PD modeling has been the conceptualization of 
the so-called set-point model [21]. This model was designed to describe 
complex effect versus time profiles of the hypothermic response following 
the administration of 5 -HTia receptor agonists to rats. This model is 
schematically shown in Fig. 2. In this model the indirect physiological 
response model (Eq. 8) is combined with a thermostat-like regulation 
of body temperature. This regulation is implemented as a continuous 
process in which the body temperature is compared to a reference or set 
point temperature ( Tsp ). 

In the model it is assumed that 5 -HTja agonists elicit hypothermia 
by decreasing the set point temperature Tsp according to: 

Tsp = To (1 - f(C)) (12) 

where 7o is the set-point temperature in the absence of any drug. Com- 
bining the indirect physiological response model with the thermostat-like 
regulation therefore yields 

kin ~ kout xT x X 7 

a x (To x 1 — / (C) — T) (13) 

in which X denotes the thermostat signal. In the model described in Eq. 
13 the change in X is driven by the difference between the body temper- 
ature and the set-point temperature Tsp on a time scale that is governed 
by a. Hence when the set-point value is lowered, the body temperature 
is perceived as too high and X is lowered. To relate this decreasing 
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dt 

dX 
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dEd 
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dE c 
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Fig. 2. Systems dynamics model for describing 5 -HTia receptor mediated hy- 
pothermia. The model is based on the concept of an indirect physiological response 
model and takes into account the rate constants associated with the warming of the 
body (kin) and the cooling of the body (kout). The indirect physiological response 
model is combined with the thermostat-like regulation of body temperature, in which 
the body temperature (T) is compared with a fixed reference or set point temperature 
(TSP) at rate a, generating a set-point signal X. The extent to which the set-point 
value decreases is a function of drug concentration f(C), which decreases X by the 
amplification factor [21], 



signal to the drop in body temperature, an effector function X 7 was 
designed, in which 7 determines the amplification. Raising this function 
to the loss term k m t x T therefore facilitates the loss of heat. In Eq. 13, 
body temperature and set-point temperature are inter-dependent and 
a feedback loop is created. A unique feature of this feedback loop is 
that it can give rise to oscillatory behaviour, allowing characterisation 
of the complex hypothermic response versus time profiles that are often 
observed upon administration of 5-HTia agonists. 



EXAMPLES OF INCORPORATING RECEPTOR 
THEORY AND DYNAMICAL SYSTEMS 
ANALYSIS IN PK/PD MODELING 

Ai Adenosine Receptor Agonists 

Adenosine is an endogenous hormone with a wide array of physiolog- 
ical actions. Adenosine exerts its actions through at least four receptor 
subtypes Ai, A 2 a, A 2 b and A 3 receptors [13]. In recent years several 
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more or less specific ligands for the various receptor subtypes have been 
synthesised, which are potentially valuable drug candidates for a variety 
of disorders. A major complication however is lack of selectivity of ac- 
tion, even for compounds that bind selectively to one receptor subtype. 
Specifically, the pronounced cardio-depressant effects have been a major 
impediment to use of Ai adenosine agonists as drugs in diseases of the 
central nervous system and lipid metabolism. As a novel approach to 
overcome this limitation, partial agonists have been designed, which in 
theory display greater organ selectivity [14]. 

To establish a scientific basis for the design of partial Ai adeno- 
sine agonists we have established a mechanism-based pharmacokinetic- 
pharmacodynamic strategy based on the operational model of agonism 
[5,6]. As the first step in this development, the operational model of 
agonism was applied to analyse data on the in vivo concentration-effect 
relationships of a series of N 6 -cyclopentyl adenosine (CPA) analogues 
[15,16,17,18]. By simultaneous analysis on these concentration-effect re- 
lationships, on the assumption of single unique (hyperbolic) transducer 
function, estimates of the affinity and efficacy could be obtained for each 
agonist. Affinity estimates obtained in this manner correlated well with 
pKj estimates for the adenosine Ai receptor in rat brain homogenates. 
In addition, a highly significant correlation was found between the es- 
timates of the in vivo efficacy parameter and the GTP shift (the ratio 
between KiKI in the presence and absence of GTP) (Fig. 3). This im- 
plies that on the basis of the proposed mechanism-based PK/PD model 
both the in vivo affinity and efficacy of adenosine Ai receptor agonists 
can be predicted on the basis of in vitro receptor bioassays. 

The objective of a subsequent investigation in this series was modeling 
of tissue-dependent agonism for the effects of N 6 -cyclopentyl adenosine 
(CPA) analogues on heart rate and lipolysis respectively. To this end 
the operational model of agonism was linked to a physiological indirect 
response model [19,20]. This modeling showed that the in vivo density 
and/or coupling adenosine Ai receptors mediating anti-lipolytic effects 
are approximately 38 times higher compared to the receptors mediat- 
ing bradycardia. Furthermore, the model predicts that it is possible to 
design adenosine Ai receptor partial agonists that produce significant 
inhibition of lipolysis and are completely devoid of cardiovascular effects 
in vivo. 

OP 3 Opioid Receptor Agonists 

Our research on modeling of the effects of synthetic opiates has fo- 
cused on i) the mechanism and quantitation of functional adaptation, 
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Fig. 3. Relationship between in vivo pKa estimates for the effect of synthetic N6- 
cyclopentyl-adenosine derivatives on heart rate and the pKI values for the binding 
at the adenosine A1 receptor in rat brain homogenate in vitro in the presence of 
GTP (panel A) and relationship between in vivo logr estimates for the effect on 
heart rate and in vitro GTP shifts (ratio between apparent affinity in the presence 
and absence of GTP) in the radioligand binding studies (panel B). The in irivo pKa 
and logr estimates were obtained by simultaneous fitting of the operational model 
of agonism to the concentration-effect relationships of 7 synthetic N6-cyclopentyl- 
adenosine analogues. The lines were obtained by linear regression. The results show 
that the in vivo affinity and intrinsic efficacy of synthetic N6-cyclopentyl-adenosine 
derivatives can be estimated on basis of in vitro bio-assays. This allows prediction of 
the in vivo concentration-effect relationship [5]. 



and ii) the extrapolation from pre-clinical animal studies to humans. 
The PK/PD correlation of a series of synthetic opiates (i.e., fentanyl, 
alfentanil and sufentanil) was characterized using quantitative EEG pa- 
rameters as a pharmacodynamic endpoint. Receptor binding was deter- 
mined in vitro on basis of the displacement of [ 3 i7] naloxone in washed 
rat brain membranes. Between compounds, large differences in in vivo 
potency (EC 50 ) but not intrinsic activity (E max ) were observed, despite 
wide differences in the value of the sodium-shift (a measure of intrinsic 
efficacy) in an in vitro receptor preparation. Simulation on the basis of 
the operational model of agonism provided evidence for a large receptor 
reserve in the system [7]. 

Upon repeated administration of alfentanil a parallel shift of the con- 
centration-effect relationship to higher concentrations was observed. Sim- 
ulations on the basis of the mechanism-based PK/PD model showed that 
this could be explained by a 40% loss of functional OP 3 -opioid recep- 
tors [30]. Subsequently we have been able to confirm this experimen- 
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tally on the basis of modeling of the effect “receptor” knock-down by 
pre-treatment with the irreversible OP 3 -opioid receptor antagonist /3- 
Funaltrexamine [8]. The mechanism-based model was applied success- 
fully to predict the in vivo potency of the new synthetic opiate remifen- 
tanil and its active metabolite GR90291 in man on basis of data obtained 
in rats [31]. 

GABAa Receptor Agonists 

Our research on GABAa agonists focused on the incorporation of re- 
ceptor theory and the extrapolation from animals to humans. A specific 
feature of this research was that the effects of two classes of compounds, 
benzodiazepines and neurosteroids, which each bind at a distinct binding 
site of the GABAa receptor complex but share the same transduction 
mechanism, were simultaneously analysed. 

On the basis of a simultaneous analysis of the concentration-EEG 
effect relationships of four benzodiazepines (i.e., flunitrazepam, midazo- 
lam, oxazepam and clobazam), a single unique system specific transducer 
function could be identified, which was described non-parametrically as 
a continuously increasing function with no saturation at higher stimulus 
intensities. In addition, the estimates of the drug specific parameters in 
vivo potency and intrinsic efficacy could be obtained for each of the drugs 
[12]. Interestingly, administration of the synthetic neurosteroid alphax- 
alone revealed an EEG effect versus time profile, which was both qual- 
itatively and quantitatively different. The EEG effect reached absolute 
values, which were much higher than with benzodiazepines. Furthermore 
the EEG effect versus time pattern was bi-phasic. A new mechanism- 
based PK/PD model, featuring a saturable (hyperbolic) function to de- 
scribe the receptor activation process in combination with a bi-phasic 
(parabolic) transducer function, satisfactorily described these observa- 
tions [32]. Application of the proposed model to the EEG effects of 
a series of (semi)-synthetic neurosteroids, confirmed that the parabolic 
transducer function is unique to the biological system, while the hyper- 
bolic function to describe the receptor activation, differs between the 
neuro steroids. This analysis revealed further that differences in the in 
vivo potency between neurosteroids, which were consistent with results 
obtained in an in vitro receptor bioassay [33]. Subsequent analysis of 
the effects of a series of GABAa receptor modulators (including benzo- 
diazepines, cylopyrrolones, and a /3-carboline) on the basis of the pro- 
posed mechanism-based PK/PD model revealed that these compounds 
behave as partial agonists relative to neurosteroids. Again estimates 
of the in vivo potency and the in vitro intrinsic efficacy were obtained, 
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which were consistent with results obtained in in vitro receptor assays 
[34]. In the mean time the proposed mechanism-based model has also 
been successfully applied to demonstrate receptor subtype selectivity in 
the effects of zolpidem [35] and to predict the effect of the combined 
administration of a neurosteroid plus a benzodiazepine [36]. 

5-HTia Serotonin Receptor Agonists 

Our research 5 -HTia on agonists focused on the application of system 
dynamics in PK/PD modeling and the assessment of the in vivo opera- 
tional affinity and efficacy of partial agonists, using the effect on body 
temperature as a pharmacodynamic endpoint. Upon the administration 
of prototype 5 -HTia agonists, complex effect versus time profiles are 
observed, which are both qualitatively and quantitatively different de- 
pending on the intrinsic efficacy of the drug and the administered dose. 
A physiological indirect response model with set-point control was found 
to accurately characterize the complex effect versus time profiles and al- 
lowed estimation of the in vivo potency and intrinsic efficacy [21]. The 
model was successfully applied to derive the in vivo concentration-effect 
relationships of a series of 5 -HTia agonists, revealing wide differences in 
in vivo potency and intrinsic efficacy [21,37,38,39,40]. Analysis of this 
information on the basis of the operational model of agonism [2] allowed 
estimation of the in vivo operational affinity and intrinsic efficacy at the 
5 -HTia receptor. An excellent correlation was observed between the in 
vivo intrinsic efficacy and parameters characterising the intrinsic efficacy 
in an in vitro bio-assay. In contrast, a rather poor correlation was ob- 
served between the in vivo affinity estimate and the corresponding in 
vitro estimates, which can be explained by complexities at the level of 
blood-brain barrier transport. 

The proposed mechanism-based PK/PD model for 5 -HTia agonists 
is of considerable theoretical interest since the incorporation of dynam- 
ical systems analysis provides a unique basis for the development of 
mechanism-based disease progression models. 



CONCLUSIONS 

The above examples illustrate the feasibility and the utility of the 
incorporation of i) receptor theory and ii) dynamical systems analysis 
in PK/PD modeling. They show that mechanism-based models have 
indeed much improved properties for extrapolation and prediction. 

The incorporation of receptor theory in PK/PD modeling facilitates 
the prediction of in vivo concentration-effect relationships on the basis 
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of information from in vitro receptor bioassays as was illustrated for Aj 
adenosine receptor agonists [5]. This provides a scientific basis for the se- 
lection of compounds for clinical development and for lead optimization. 
Specifically, the mechanism-based PK/PD model for the design of par- 
tial agonists with an improved selectivity of action for the anti-lipolytic 
effects of Ai adenosine receptor partial agonists is of considerable inter- 
est for the development of these compounds into drugs for the treatment 
of type-2 diabetes mellitus [6]. 

Another application of the incorporation of receptor theory in PK/PD 
modeling is the prediction of in vivo concentrations-effect relationships 
in man on the basis of information in pre-clinical in vivo animal stud- 
ies. This is illustrated by the successful prediction of the concentration- 
EE G effect relationships of the new synthetic opioid remifentanil and its 
metabolite GR90291 in man [31]. In this specific example the concen- 
tration-effect relationships were nearly identical in rats and in humans, 
indicating that the OP3 opioid receptor functions in a nearly identical 
manner in these species. It should be realised however that this is not a 
prerequisite for the successful inter-species extrapolation of drug effects. 
In theory, inter-species extrapolations are equally feasible if differences 
in the functioning of specific receptors (i.e., the receptor density, the 
efficiency of receptor-effector coupling) have been characterized. 

An important feature of the incorporation of receptor theory is also 
that it allows understanding of intra- and inter-individual variability in 
drug response. This is elegantly illustrated in the studies with alfentanil, 
where it was shown that 40% receptor down regulation by pre-treatment 
with the irreversible OP 3 receptor antagonist /? funaltrexamine results in 
a parallel shift of the alfentanil concentration-EEG effect relationship to 
higher concentrations [8]. A previously proposed in vivo receptor model 
for synthetic opioids readily predicted this shift in the concentration- 
EEG effect relationship [7]. 

To date, receptor theory has been applied in PK/PD modeling to un- 
derstand the shape and location of in vivo steady-state concentration- 
effect relationships and to explain intra- and inter-individual variability 
in pharmacodynamics. An important perspective for future research is 
the development of dynamic receptor models, to account for time depen- 
dencies in the pharmacodynamics caused by slow receptor association/ 
dissociation at the receptor. This involves the estimation of the first- and 
second-order rate constants for receptor association and dissociation. In 
this respect it is important that for certain receptor-drug combinations 
the rate constants for association-dissociation have been shown to be 
slow relative to their pharmacokinetics and/or the rate constants for 
biophase equilibration. 
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The incorporation of dynamical systems analysis in PK-PD modeling 
is important to explain complex effect versus time profiles, which may 
be observed when homeostatic feedback mechanisms are operative. This 
is illustrated in the investigations on the modeling of the hypothermic 
response following the administration of 5-HTia serotonin receptor ag- 
onists, where a set-point model was successfully applied to characterize 
oscillatory behaviour in the pharmacological effect [9]. It was also shown 
that this type of modeling is essential to derive the direct concentration- 
effect relationships at the receptor, revealing wide differences in in vivo 
intrinsic efficacy, which are of critical importance in relation to their 
putative therapeutic indications [40]. 

It is anticipated that dynamical systems analysis will increasingly be 
applied to characterize functional adaptation and tolerance phenomena. 
An important application in this respect will be research into the influ- 
ence of rate of administration on drug response. It is well-established 
that for arteriolar vasodilators (i.e., dihydropyridine calcium antago- 
nists) the haemodynamic effect is strongly influenced by the rate of ad- 
ministration [41]. Recently, it has been shown that this applies also to 
other classes of drugs [42]. This can have major implications for the 
design of optimal (oral) dosage forms. 

Finally, differences in the efficiency of homeostatic feedback mecha- 
nisms can be a major cause of intra- and inter-individual variability in 
drug response (i.e., in aging). At present their role in this respect is 
relatively poorly understood. This underscores the importance of the 
incorporation of dynamical systems analysis in (population) PK/PD in- 
vestigations. 
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INTRODUCTION 

In pharmacology, modeling of drug-receptor interaction is mostly 
based on the traditional steady-state approach of measuring concen- 
tration-response curves that neglect the role of reaction kinetics. These 
methods may be less useful for the analysis of heterogeneous receptor 
systems consisting of subpopulations with different affinities as well as 
for the characterization of postreceptor events. Incorporation of reaction 
kinetics is critical for a better understanding of more complex systems. It 
is apparent that the ki netic properties of receptor binding and transduc- 
tion processes are poorly understood. Although the analysis of drug- 
receptor interaction in vivo using pharmacokinetic /pharmacodynamic 
(PK/PD) modeling in rats (e.g., [1,2] and the chapter by Danhof et 
al. in this volume), represented an important step forward, more com- 
plex mechanistic models that include transcapillary drug exchange, the 
dynamics of drug-receptor interaction and transduction kinetics, for ex- 
ample, can hardly be identified using plasma concentration and response 
data alone. One way to overcome this problem is complexity reduction; 
i.e., the decomposition of the system into subsystems which can be iden- 
tified separately. Since the organs represent the natural subsystems of 
the body, experiments in isolated perfused organs offer an efficient possi- 
bility to develop more detailed PK/PD models that provide insight into 
underlying processes and can be scaled-up from the organ to the whole 
body level [3,4]. 

In this chapter we describe the development of such a concept for the 
cardiac glycoside digoxin [5]. Cardiac glycosides have been used in the 
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treatment of patients with heart failure for about 200 years and it is now 
widely accepted that the cardiac glycosides exert their positive inotropic 
effect through an inhibition of Na + /K + -ATPase (sodium pump) which 
serves as functional receptor for digitalis. In the adult rat heart, the 
underlying heterogeneous receptor system is well characterized at the 
molecular level: it consists of two receptor isoforms, ct\ and 0 . 2 , exhibit- 
ing low (ai) and high ( 02 ) affinity for glycosides, respectively, whereby 
c *2 comprises only 10-25% of the sodium pumps [6,7]. However, the func- 
tional role of these Na + /K + -ATPase isoforms and post-receptor events 
in mediating the positive inotropic glycoside effect is still poorly under- 
stood. We present a model for digoxin pharmacokinetics in the perfused 
rat heart and discuss various pharmacodynamic models, including recep- 
tor interaction and postreceptor events, in terms of experimental results. 



MODEL 

Positive Inotropic Effect of Cardiac Glycosides 

Cardiac glycosides act by binding to the extracellular face of the sar- 
colemmal Na + /K + -ATPase, thereby inhibiting its ability to exchange 
intracellular Na + for extracellular K+ In the rat heart (with rela- 
tively high intracellular Na-I- concentration), the resulting increase of 
[Na + ]j (i.e., the reduction of the Na + gradient) subsequently leads to 
an increased influx of Ca 2+ via the sarcolemmal Na + /Ca 2+ exchanger 
(NCX), which is operating in a Na + -efflux/ Ca 2+ -influx mode. Con- 
sequently, more Ca 2+ will be stored in the sarcoplasmic reticulum to 
be released during subsequent contractions [8,9]. Thus, the binding of 
digoxin to receptors (sodium pumps) initiates a series of dynamic events 
that ultimately lead to an increase in the force of contraction (Fig. 1). 

Cardiac Uptake and Binding Processes 

To describe the pharmacokinetics of digoxin in the rat heart, we have 
developed a physiologically realistic compartmental model. As shown 
in Fig. 2, perfusate flow ( Q ) as well as drug outflow occur in the vas- 
cular space (distribution volume V va »), i.e., the measured outflow con- 
centration Ccmt{t ) = D vaa (t) /V vas is the concentration in the vascular 
compartment. Transcapillary transport between vascular and intersti- 
tial space is described by rate constants k m and ki V and the appar- 
ent permeability- surface area or permeation clearance CL m = k^Vyaa 
(clearance of digoxin from the vasculature) quantifies myocardial uptake 
of digoxin. The binding probability of digoxin in the interstitial space 
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Fig. 1. Digoxin inhibits the Na + /K + -ATPase by binding to its a-subunits and 
via the Na + -gradient-coupled Na + /Ca 2+ exchanger, this increases intracellular Ca 2+ 
availability for contractile proteins. 



(amount Di S ) to two saturable binding sites (i =1,2) is dependent on 
the association rate constants k{ (in units of 1/min/nmol) and concen- 
trations of free membrane receptors which is equal to ( RtotjDRi ) where 
Rtot,i is the unknown amount of available receptor sites and DRi de- 
notes the digoxin - receptor complexes (i.e., amount of bound digoxin), 
i.e., the resulting fractional binding rate Ki(t) = ki[R to t,iDRi(t)\ is time- 
dependent. The rate constants for the dissociation of the bound ligand 
were denoted by (in units of 1/min). As suggested by independent 
experiments with the vascular marker Evans blue (data not shown), a lag 
time to and an additional compartment with volume Vo (in series with 
the capillary compartment) were introduced to account for the delayed 
drug appearance and mixing in nonexchanging elements of the system, 
respectively. (Initial estimates of the parameters to and Vo were obtained 
by fitting the outflow curve of Evans blue (1 min infusion) to a model 
consisting of only one additional compartment representing the vascular 
space.) The corresponding differential equations that describe changes 
in the amounts of digoxin in the mixing, capillary and interstitial com- 
partment as well as the compartments representing the two saturable 
binding sites after infusion of digoxin (dosing = RATE ) at the inflow 
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Fig. 2. PK/PD model of inotropic digoxin action in the rat heart with vascular, 
interstitial and cellular compartments and two distinct sarcolemmal binding sites, Ri 
and R2 (qi and Q 2 receptors) assuming that receptor occupation (DRi) initiates the 
effectuation process. 



side of the heart perfused at flow Q (single-pass mode) are given by Eqs. 

(1-5): 



dP o (t) 
dt 

dPvaa (t) 

dt 

dP is (t) 

dt 



dPR\ ( t ) 
dt 

dPR<i ( t ) 
dt 



- (j|) D 0 (t) + RATE 


(1) 


+ k v ij P V as (t) + ki v Pi s (t) + 


) A (0(2) 


k v iDvas{t) ~ [kiv + k\(Rtot,\ ~ DRi(t)) 

+k<i{Rtot,2 — PR2(t))]Dis(t) 
~{-k-\PRi(t) + k—2DR2{t ) 


(3) 


k\ [ Rtot,i ~ DRi (*)] As (0 - k-iPRi (t) 


(4) 


k2 [Rtot,2 ~ DR 2 (*)] As (t) - k-2DR2 ( t ) 


(5) 



Note that cellular uptake of digoxin can be neglected during the time 
course of the experiment. 
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Kinetics of Receptor Binding and Effectuation 

The data were analyzed by assuming a reversible one-to-one interac- 
tion between digoxin and a single receptor class Ri on the sodium pump 



Dis + RiUDRi, i = 1,2 (6) 

ki 

The dynamics of receptor occupation is described by Eqs. (4) and (5) 
and at steady state we obtain for Dva ^ = — = D1 = 0 in Eqs. 
(2-5) 



p.n R-totjCgs 

ss,i ~ KnAh.JCL,^ + C, 



(7) 



where Kp,i = k-i/ki and Ka,i = 1/Kd,i represent the equilibrium disso- 
ciation and affinity constants, respectively, of the two receptor systems 
(K D>i = DisRi/DRi). Since a mechanistic model of the postreceptor 
events shown in Fig. 1. (higher Ca 2+ availability due to activation of 
Ca 2+ influx via NCX after increase in [Na + ]j) would be necessarily over- 
parameterized, a minimal transduction model, i.e., an ad hoc equation, 
was selected to mimic the observed behavior [10, 11]. Considering initia- 
tion of physiological activity upon binding to the heterogeneous receptor 
system, the pharmacological response E(t) is a function of the number of 
receptors occupied by drug, i.e., ligand-receptor complexes, DR\(t) and 
DR 2 (t). Thus, one can assume that the time course of positive inotropic 
effect is given by two components 



E(t) = <f> l {DR 1 (t)) + <h\DR 2 {t)\ (8) 

According to the operational model of receptor agonism [12], we use 
a hyperbolic function for the transduction of receptor occupancy into 
response <f> (stimulus-effect relationship for each receptor subtype) 



0max,t DRj(t) * e l ^ T 

KDR,i + DRi(t ) * e _t / T 



(9) 



where Kdr,i denotes DRi producing 50% of (t> T nax,i (reflecting the effi- 
ciency of response generation via receptor i) and * denotes the convo- 
lution operation. In contrast to the operational model which only holds 
under steady-state conditions, Eq. (9) is based on the assumption that 
the stimulus is delayed relative to DRi(t) by a mean transduction time 
r (Effectuation Model I). 

Alternatively, we can calculate a total stimulus with delayed response 
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m = 



^ma xER^jt) - t/T 

Kdr + DRz(t) 



( 10 ) 



where DRz(t) is the total functional receptor occupation, i.e., the weighted 
sum of both isoforms 



DR e (t) = pDR x (t) + (1 - p) DR 2 (t) (11) 

where pis the proportion of /2i_ occupancy contribution (Effectuation 
Model II). 

A third possibility that accounts for both saturation and transduc- 
tion time is the ternary complex model originally proposed for signaling 
through G-protein coupled receptors (e.g., (13)), 

DRi ~h Aj ^ DRiAi i — 1, 2 (12) 

where A, is a hypothetical second messenger (intermediate stimulus) and 
the response is proportional to the amount of DRiAi complexes 

E (t) = a\DR\A\ (t) + ( 12 DR 2 A 2 (t) (13) 

and two differential equations have to be added to Eqs. (1-5): 



dDR\A\ (t) 
dt 

dDRiA2 ( t ) 
dt 



h [A t ot,i ~ DR!Ai (<)] DRi ( t ) - U DR^ (*)(14) 
h [Am , 2 ~ DR 2 A 2 (f)] DR 2 (<) - I- 2 DR 2 A 2 (t)(15) 



(Effectuation Model III). 

Note that these empirical models of the effectuation process do not 
explicitly account for the chain of postreceptor events described above 
for the positive inotropic digoxin effect (Fig. 1); more detailed models, 
however, could not be identified on the basis of the available data. Under 
steady-state conditions, Model I (Eqs. (8) and (9)) leads to the following 
hyperbolic concentration-response curve ( E max model) 



Emax, \C S s E mBXi 2C ss 

ECs 0,1 + C aa EC$o,2 + Css 



with parameters 



Km ax, i 

ECso,i 



^max, jRtot,i 

KDR,i + Rtot, i 

E DR,jE D t i{kjy / C Lyj) 

E[)R,i d~ Rfot,i 



(16) 



(17) 



(18) 
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(Note that with a different parameterization we get the same result for 
Model III.) 

The effectuation model greatly simplifies and collapses to a linear 
function of receptor occupation when pump inhibition (rate of receptor 
occupation DRi (<)) is the rate limiting step (i.e., the effect is in phase 
with Dili) and no saturation occurs, i.e., the effect induced by the mix- 
ture of occupied receptors can be approximated by a linear combination 
of DR\ ( t ) and DR 2 ( t ) 



E ( t ) — e\DR\ ( t ) -t- e 2 DR 2 ( t ) (19) 

where the parameter ej is the effect per unit of digoxin-receptor complex. 

Parameter Estimation 

Although the above model structure is already a minimal model that is 
reasonably consistent with the physiological knowledge of digoxin uptake 
and action, the number of parameters to be estimated is relatively large. 
Furthermore, due to the nonlinearity of the differential equations to be 
solved the system may have multiple solutions, and even if there is only 
a single solution, the estimates obtained by nonlinear regression can be 
highly dependent on the initial values. Since the information content of 
the outflow data is inadequate to support such a nonlinear model with a 
relative rich parameterization, we take advantage of the fact that a priori 
information on the unknown model parameters is available, e.g., results 
obtained from in vitro receptor binding experiments of digoxin. The 
Bayesian approach to model identification (e.g., maximum a posteriori 
(MAP) estimation) is a theoretically sound method to incorporate such 
knowledge in probabilistic terms. The system of differential equations 
was solved numerically and parameter estimation was performed with 
the ADAPT II software [14], which contains a MAP Bayesian estimation 
feature. For all pharmacokinetic and pharmacodynamic fits, an additive 
plus proportional intraindividual error model was used. 

The digoxin outflow data were first analyzed to obtain estimates of 
the transport and binding parameters. Those values were then fixed in 
fitting the time course of left ventricular developed pressure ( LVDP ) 
obtained from the same experiments. The LVDP (t) data was used as a 
measure of inotropic response, i.e., the increase in LVDP with respect 
to the baseline (predrug) value LVDPq, 



m = 



LVDP(t) - LVDP 0 (t ) 
LVDP 0 (t ) 



( 20 ) 
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The terminology “effect”, E ( t ), will be used throughout for the frac- 
tional change of LVDP. As noted above, in Bayesian estimation we 
made use of a priori knowledge on the ratios of the receptor affinities 
(K a<2 /Ka,i = 13.7) and capacities ( Rtot,i/Rtot,2 = 8.7) of digoxin ob- 
tained from rat ventricle, microsomal preparations [6] setting the frac- 
tional dispersion of these ratios to 20%. The volume of the vascular 
compartment was fixed for a literature value ( V vas = 0.06 ml/g) taken 
from anatomic data [15]. 

In order to improve the accuracy and reliability of parameter estima- 
tion, three outflow, C ( t ), and three response data, E(t), measured for 
three doses as consecutive 1-min infusions in each heart were simultane- 
ously fitted by a single set of parameter values. This method, also known 
as global or simultaneous nonlinear regression (SNLR), has recently be- 
come practical for fitting nonlinear enzyme -kinetic models to data (e.g., 
[16]). The results are expressed as average ±S.D. of the parameters es- 
timated in 12 hearts with [Ca 2+ ] ei( = 2.5mM (3 doses of 78.1, 156.2, 
and 234.3 pg ) and two groups of experiments (5 hearts in each group) 
with [Ca 2+ ] ex( = 0.5 and 1.5 mM (3 doses of 15, 30, and 45 pg), re- 
spectively. In the latter two groups, experiments were performed under 
control conditions and in the presence of the NCX inhibitor KB-R7943 
(0.1 ^M) in the perfusate. 



RESULTS AND DISCUSSION 

In all control experiments (i.e., without NCX inhibition), the sim- 
plest effectuation model (Eq. (19)) was sufficient to describe the data. 
This is demonstrated in Fig. 3 and Fig. 4 (left) for experiments with 
external calcium concentrations of 2.5 and 0.5 mM, respectively; the 
mechanistic PK/PD model fitted the outflow and response data (three 
consecutive doses in one heart followed by 15-min washout periods) very 
well. The fact that the model can bring six different sets of data into 
a single quantitative description also suggests its usefulness. The model 
was conditionally identifiable and parameters were estimated with rea- 
sonable precision. For an external calcium concentration of 2.5 mM [5], 
cardiac kinetics of digoxin was characterized by transport across the 
capillary barrier with a permeation clearance of = 2.35 ± 0.58 
ml/min/g, which was lower than perfusate flow (8.7 ml/min/g) indicat- 
ing barrier-limited uptake by passive transport through interendothelial 
gaps. The distribution kinetics of digoxin in interstitium was determined 
by specific binding to two distinct extracellular sites, R\ and R 2 , alow 
affinity receptor with high capacity (Rp Kd,\ — 20.9 ±8.4 nmol) and a 
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Time (min) 



Fig. 3. Representative simultaneous fit of the model to PK (upper) and PD (lower) 
data for three digoxin doses of 0.3, 0.2 and 0.1 mol (234.3, 156.2 and 78.1 fig) and 
external calcium concentration of 2.5 mM: Adapted from [5j. 



high affinity receptor with low capacity (i? 2 : Kd , 2 = 1.5 ±0.6 nmol). In 
contrast to conventional PK/PD-modeling, where drug-receptor interac- 
tion does not influence mass balance, specific binding was an important 
determinant of digoxin PK in the rat heart. A similar approach has been 
recently used by Mager and Jusko [17] to describe interferon PK/PD. 
In view of the nonlinearity of the system, our simultaneous analysis of 
concentration-time and response-time curves measured for three doses of 
digoxin offers the opportunity to overcome these difficulties by increasing 
the amount of information. 

That under control conditions the time course of inotropic action of 
digoxin E ( t ) was successfully described by the simple linear model (Eq. 
(19)) as weighted sum of drug-receptor complexes DR\ (t) and DR^ (f), 
whereby the parameters e\ and e<i can be regarded as the respective 
effects per unit of digoxin-receptor complex, suggests that the relatively 
slow receptor binding is the rate-limiting step in response generation and 
that the effectuation process shows no saturation. 
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Fig. 4. Representative simultaneous fit of the model to PK (upper) and PD (lower) 
data for three digoxin doses (15, 30, and 45 fj. g) and an external calcium concentration 
of 0.5 mM under control conditions (left) and in the presence of the NCX inhibitor 
KB-R7943 (right). 

Receptor Heterogeneity 

The present approach allows, for the first time, identification the func- 
tional role of the receptor isoforms, a\ and c* 2 , in the intact rat heart. 
Thus, on this model, we can correlate the physiological response to ai- 
and a 2 -subunit inhibition. Based on the predicted digoxin receptor oc- 
cupancy, DR\ ( t ) and DR 2 ( t ), at the end of infusion, Fig. 5 illustrates 
that the positive inotropy is predominantly mediated by the high affinity 
pumps ( 02 ); while their effect decreases with increasing dose, the con- 
tribution of the low affinity pumps (c*i) increases. To test whether the 
response could be solely mediated by high affinity receptor binding (R 2 ), 
we refitted the data assuming that binding to R\ would not contribute 
to E ( t ) (i.e., e\ = 0 in Eq. (19)). The shape of the best fitted curve 
in Fig. 6 clearly demonstrates that the model then fails to describe the 
high dose (234.3 /rg) response. A significant reduction of the generalized 
information criterion for the MAP estimator [14] indicates that this re- 
sult is also statistically supported by the current data, suggesting that 
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Fig. 5. Model prediction of the contribution of ou receptor (low affinity, high 
capacity) and <*2 receptor (high affinity, low capacity) occupation to the maximal 
effect at the end of infusion. Adapted from [5]. 



the low affinity/high capacity isoform (c*i) significantly contributes to 
the effect at higher dose levels ( P < 0.05). This is also demonstrated 
in Fig. 7 where the equilibrium concentration-response curve predicted 
from the mean parameters estimates of the 0.5 mM Ca 2+ experiments 
is shown together with the underlying components. Although this con- 
secutive inhibition of first <* 2 - and then the ai-isoform of the Na + pump 
with increasing dose is in contrast to the hypothesis that only the inhi- 
bition of the c* 2 -isoform by cardiac glycosides would induce the positive 
inotropic effect [18], the a 2 -isoform (high affinity receptor R 2 ) is char- 
acterized by a higher (~ 2 fold greater) sensitivity of the transduction 
mechanism, e 2 (P < 0.05). This could be explained by the postulated 
colocal iz ation of the c^-receptor and the Na + /Ca + in the sarcolemma at 
sites of restricted diffusion (Fig. 1) since the effects of pump inhibition 
may then be amplified beyond those expected on the basis of the bulk 
cytoplasmic Na + level [19]. 

Postreceptor Events 

As noted above, under normal conditions, the effect was proportional 
to receptor occupancy and the estimates of ei and e 2 determine the 
efficacy of response generation, i.e., the ability of inhibited receptor 
subtypes to licit a positive inotropic effect (Eq. (19)). Both effica- 
cies, e\ and e 2 , increased 9- and 11-fold, respectively, when external 
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infusion infusion infusion 

Time (min) 



Fig. 0. Effect data fitted by the complete model (Fig. 4) (upper) and by a reduced 
model where the effect is solely induced by a-i receptor occupation (lower) for three 
digoxin doses of 0.3, 0.2 and 0.1 ^moi (234.3, 156 2 and 78.1 y.g). Adapted from (5). 



calcium concentration was decreased from 2.5 to 0.5 mM; this increase 
in digitalis sensitivity is also obvious from the predicted steady-state 
concentration-response curves shown in Fig. 8 (Eq. (7)). However, this 
simple model failed to describe the effect-time data in the presence of 
the NCX-inhibitor KB-R7943. Notably, a reasonable fit was obtained 
by all three effectuation models (Models I— III) defined above, neglecting 
the contribution of the low affinity receptor (cvi); only for Model II (Eqs. 
(10) and (11)) the inclusion of oj increased the AlC-value, however, the 
parameter p could not be estimated with sufficient reliability. Because 
there were no significant differences among these models regarding their 
ability to fit the response data, we used Model I ($2 in Eqs. (8) and (9)) 
in the case of NCX inhibition. Although this model well captured the 
major features of the data, it failed (like the others) to fit the peak of 
the curve (Fig. 4, right). The modeling of the data obtained in the pres- 
ence of KB-R7943 indicates that NCX-inhibition leads to a saturated 
and delayed effectuation process. This is in accordance with the role of 
NCX in the transduction of digoxin receptor occupancy into the positive 
inotropic effect: when the efficiency of Ca ++ transport into the cell via 
NCX decreases this process may become saturated and rate -limiting. 
Furthermore, due to the lower affinity and efficacy of the aq receptors, 
their contribution might decrease under such conditions. The saturation 
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Pig. 7. Model predictions illustrating the contribution of »i and a? receptor- 
mediated effects (dotted lines) to the steady-state concentration-response relationship 
(based on mean parameter estimates for [Ca 2+ ]= 0.5 mM). 



of the stimulus-response relationship after NCX-inhibition and the effect 
of external Ca ++ on efficacy are illustrated in Fig. 9 with the steady- 
state occupation-response curves predicted by Eqs. (7) and (19) or Eqs. 
(16-18), respectively, for the (^-mediated effect using mean values of pa- 
rameter estimates. It is important to note, however, that the validity of 
the steady state concentration (or occupation) response curves predicted 
with the parameters estimated in our 1-min infusion experiments is based 
on the assumption that the system is time invariant. Interestingly, the 
ternary complex model that accounts for the kinetics of activation of a 
mediator or intermediate stimulus, where E ( t ) = a^DR^Ai (t) (Model 
III, Eqs. (13-15)) fitted the data as well as the other effectuation models; 
this is not surprising in view of the fact that E (t) data generated by 
this model can be fitted using the static stimulus-response model with 
delay (Model I) (results not shown). 

The present results obtained for digoxin also shed some new light on 
the limitations of the conventional link approach in PK/PD modeling 
in case of slow receptor binding. Theoretically, it is clear that the link 
model only holds for instantaneous receptor binding (e.g., [20]); how- 
ever, the validity of this assumption has hardly been verified in previous 
applications. In contrast to the link model our nonlinear kinetic model 
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Fig. 8. Average steady-state concentration-response curves predicted with the 
parameters estimated in experiments with different external calcium concentrations 
(0.5, 1.5 and 2.5 mM). 




NCX 

inhibition 



NCX 

inhibition 



Fig. 9. Influence of external calcium concentration and NCX inhibition on the 
Q 2 -mediated stimulus-response relationship predicted for control experiments using 
Eqs. (7) and (19) (solid lines) and after NCX inhibition using Eqs. (16-18) (dotted 
lines). 
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implies a dose-dependency of the apparent effect equilibration time r 
(temporal delay between effect and concentration). Thus, when tran- 
scapillary exchange is not rate-limiting, one obtains the step response 
for a one-receptor system (Eq. (4)) 

E s tep(t) = E ss ( l-e-'/W) (21) 

with a dose (concentration)-dependent time constant of the step response 

™" = + (22) 

The results presented here strongly indicate the need to reconsider the 
validity of the empirical link model for digoxin PK/PD in humans [2 1 ,22] 
since slow receptor binding but not distribution from plasma to its my- 
ocardial site of action might account for the delayed response [23]. 



CONCLUSIONS 

As demonstrated for digoxin, the present approach of PK/PD mod- 
eling in the perfused rat heart provides estimates for the proportions of 
receptor subtypes in the mixture of receptor populations as well as the 
affinity constant for each receptor subtype, hi contrast to methods based 
on equilibrium concentration-response curves, this method permits dis- 
crimination between the effects of receptor occupation and effectuation 
(signal transduction) in response generation. NCX inhibition led to a 
delay and saturation in the stimulus-response relationship. Using this 
model, it was possible to gain further quantitative insights concerning 
the role of transport, receptor binding and postreceptor events in deter- 
mining the positive inotropic effect of cardiac glycosides. 
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INTRODUCTION 

Corticosteroids (CS) are widely used anti-inflammatory and immuno- 
suppressive drugs with numerous side effects. Upon binding to the 
glucocorticoid receptor (GR) in target cells, these hormones may di- 
rectly cause rapid effects such as cortisol suppression, as well as de- 
layed effects via gene/protein regulation [1], Genes affected by CS 
include both immunosuppressive genes controlling therapeutic effects 
and metabolic genes producing adverse effects. Better understanding 
of CS pharmacogenomics (PG) will not only provide more insight on 
their mechanisms of action, but also aid in optimization of clinical ther- 
apy. We have studied corticosteroids for more than 25 years. In the 
past decade, selective gene/protein markers have been used to study the 
genomic effects of steroids in vivo, and mechanism-based pharmacoki- 
netic/pharmacodynamic (PK/PD) models have been developed to de- 
scribe the time profiles of these effects [2, 3 ,4, 5]. This report presents the 
highlights of a recent microarray study in which the temporal changes 
of 8000 genes in rat liver were examined after single-dose administra- 
tion of methylprednisolone (MPL). Cluster analysis revealed 6 tempo- 
ral patterns consisting of 197 CS-responsive probe sets representing 143 
genes. Mechanistic PK/PD models were developed to describe each 
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CS-responsive gene. For details the reader is referred to two recent pub- 
lications [6, 7] and the cited references. Our results reveal the marked 
diversity of genes regulated by CS via a limited array of mechanisms. 
The PD models provide quantitation of CS pharmacogenomics and ex- 
emplify the use of mathematical modeling to describe the molecular 
system. 



MODELING OF SELECTIVE GENE/PROTEIN 
MARKERS 

Advances in pharmacodynamic modeling have been steadily growing. 
There are a variety of mathematical models available for handling ex- 
perimental data depending on the diverse mechanisms and rate-limiting 
processes that control drug effects (see review in ref. [8]). Fig. 1 rep- 
resents the array of PD models that served as the bases for our model 
development. 

Understanding of the mechanism of drug action and measurements of 
the major contributing intermediate steps are essential for mechanistic 
modeling. The cellular processes for CS pharmacogenomics are depicted 
in Fig. 2. Unbound CS in blood is moderately lipophilic and freely 
diffuses into the cytoplasm of liver cells. These steroids quickly bind 
to the cytosolic GR, and cause activation of the receptor. This may 
lead to some rapid effects such as cell trafficking that seem not to de- 
pend on genomic mechanisms. The activated steroid-receptor complex 
may further translocate into the nucleus where it can bind as a dimer 
to glucocorticoid responsive elements (GREs) in the target DNA and 
lead to the control of various genomic processes. The CS are known 
to cause homologous down-regulation of their own receptor due to de- 
creased transcription [15]. After the transcriptional control of target 
genes, the steroid-receptor complex in the nucleus may dissociate from 
the GRE, return to the cytosol, and either are degraded or recycled. In 
addition to the direct transcriptional regulation of target genes, CS may 
affect DNA transcription of functional biosignals/transcription factors 
(represented by BS), and cause changes in mRNA and protein concentra- 
tions of these regulators. Genes whose expression are controlled by these 
factors at transcriptional and/or post-transcriptional levels will thus be 
secondarily affected by CS. The steroid-receptor complex may also di- 
rectly or indirectly affect mRNA stability in cytosol. Our laboratory 
has been involved extensively in modeling corticosteroid pharmacody- 
namics and related physiological systems. The majority of rapid effects 
of CS appear to be well described by indirect response models [16-18]. 
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Fig. 1. Some basic PD models. Panels: (a) Direct effect model [9]. (b) Bio- 
phase distribution model [10]. (c) Basic indirect response models [11]. (d) Precursor- 
dependent indirect response models [12]. (e) Signal transduction model [13,14]. C p 
- systemic drug concentration, C c - biophase drug concentration, ku and k e o - first- 
order rates of drug distribution to the biophase, ki„ - zero-order production rate of 
the response marker, k ou t - first-order disposition rate of the response marker, ko - 
zero-order production rate of the endogenous precursor, k, - first-order disposition 
rate of the precursor, k p - first-order production rate of the response marker from its 
precursor, R - free receptor, RC - drug-receptor complex, E n - the nth biosignal in 
the transduction process, r - mean transit time of the transduction process. 



Modeling the genomic effects of CS has been an evolving process. Since 
1986, a series of mechanism-based PK/PD models (Fig. 3) have been 
proposed to describe the gene-mediated effects of steroids in terms of re- 
ceptor down-regulation and enzyme induction in rats [2,4, 19-23]. These 
models evolved over time with the addition of mRNA measurements 
and more extensive studies in terms of dosages used and duration of tis- 
sue sampling. Several basic pharmacodynamic theories were integrated 
in these models including receptor binding, indirect response models, 
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Fig. 2. Schematic representation of diverse molecular mechanisms of CS action 
on gene expression in hepatocytes. GR - glucocorticoid receptor, GRE - glucocorti- 
coid responsive element, BS - functional biosignal (such as transcription factors or 
hormones). 

signal transductions, and tolerance development due to receptor down- 
regulation. The tyrosine aminotransferase (TAT) enzyme in liver was 
selected as the PD marker. The most recent “fifth-generation model” 
(Fig. 3(e)) described the pharmacokinetics (Fig. 4(a)), GR rnRNA 
repression (Fig. 4(c)), receptor dynamics (Fig. 4(d)), TAT rnRNA in- 
duction (Fig. 4(e)), and its enzyme induction (Fig. 4(f)) in liver upon 
acute dosing of MPL in ADX rats [2,23]. Also shown (Fig. 4(b)) is the 
simulated profile of dmg-receptor complexes in the nucleus (DR(N)), 
which can act as the major driving force for genomic effects of CS. 



MODELING OF CORTICOSTEROID 
PHARM ACOGENOMICS : 

THE MICRO ARRAY STUDY 
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Traditional assay techniques such as Northern blots and RT-PCR only 
allow measurement of a single or very small number of messages. In the 
past decade, the development of microarrays enables simultaneous ex- 
amination of thousands of genes and offers the opportunity to study 
a more global picture of gene regulation. Due to the magnitude of 
data obtained by gene array studies, there is a growing awareness of 
the need for development of mathematical models and other bioinfor- 
matic tools that will allow estimation of kinetic parameters that gov- 
ern the biologic processes. Corticosteroid pharmacogenomics were re- 
cently studied using gene microarrays in rat liver [6,7], In brief, methyl- 
prednisolone was administered intravenously at 50 mg/kg to adrenalec- 
tomized rats. Animals were sacrificed and liver excised at 17 time points 
over 72 hours. Four untreated rats were sacrificed at 0 hr as controls. 
RNAs from individual livers were used to query Affymetrix GeneChips© 
(Affymetrix, Inc., Santa Clara, CA), which contain sequences for 8000 
rat genes. Cluster analysis was performed using Affymetrix Microar- 
ray Suite 4.0© (Affymetrix, Inc., Santa Clara, CA) followed by Gene- 
Spring 4.1© (Silicon Genetics, Redwood City, CA). Six temporal pat- 
terns consisting of 197 CS-responsive probes representing 143 genes were 
revealed from the cluster analysis. The whole dataset is available on- 
line at http://microarray.cnmcresearch.org/ (link Programs in Genomic 
Applications). Based on our fifth-generation model of steroid PK/PD, 
mechanistic models were developed to describe the time -pattern for each 
CS-responsive gene [7]. As shown in Fig. 3(e) (inside the grey dotted 
square), the drug kinetics and receptor dynamics were modeled as before 
(Fig. 4(a)-(d)) with the differential equations as follows: 



Cmpl 


= Ci ■ e~ Xl t + C 2 ■ e~ X2 t 


(1) 


dmRNAjt 


/ DR(N) \ 


(2) 


dt 


s ' Rm V ICt0.Rm + DR(N)J 


dR 

dt 


kd-Rm ■ ttiRNAr 

= k s _R • mRNAfi + Rj ■ k re ■ DR(N) - 


(3) 


dDR 

dt 


kon - D R — kd_R ■ R 
= kon • D ■ R — kx • DR 


(4) 
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Fig. 3. Proposed receptor/gene-mediated pharmacodynamic models for CS action. 
Panels: (a). First-generation model [19]. (b). Second-generation model [20]. (c). 
Third-generation model [21]. (d). Fourth-generation model [2]. (e). Fifth-generation 
model [23], 
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Tyrosine Aminotransferase Induction in Rat Liver by 
MPL (Dose = 50 mg/kg) 






Fig. 4. PK/PD modeling results using the fifth-generation model. Adrenalec- 
tomized rats received 50 mg/kg intravenous injection of MPL at 0 hr. Panels: (a) 
Plasma MPL concentration, (b) Simulation drug-receptor complex in hepatocyte nu- 
cleus (DR(N)). (c) GR mRNA level in liver, (d) FYee GR density in liver, (e) TAT 
mRNA level in liver, (f) TAT activity in liver versus time. Solid circles are the 
observed data and bars are the standard deviations. Lines are model predictions. 
Modified from ref. [2,23]. 



dDR(N) 

dt 



kr-DR-kre- DR(N) 



(5) 



where symbols represent the plasma concentration of MPL in ng/ml 
(Cmpl) i the plasma concentration of MPL in nmol/1 ( D ), the receptor 
mRNA (mR N A r), the free cytosolic GR density (R), cytosolic drug- 
receptor complex (DR) and drug-receptor complex in nucleus (DR (N)). 
The Ci and A, are coefficients for the intercepts and slopes of the PK 
profile. The rate constants in the equations include: zero-order rate 
of GR mRNA synthesis ( k a _ft m )\ the first-order rates of GR mRNA 
degradation (kd_R m ), receptor synthesis (k s _ri) and degradation (&<*_#), 
translocation of the drug-receptor complex into thenucleus ( kr ), and the 
overall turnover of DR(N) to cytosol (fcre); as well as the second-order 
rate constant of drug-receptor association ( kon ). In addition, /Cso.fim 
is the concentration of DR(N) at which the synthesis rate of receptor 
mRNA drops to 50% of its baseline value, and Rf is the fraction of free 
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receptor being recycled. The baselines were defined using the following 
equations: 



kd_Rm 

k a .R 

where mRNA° R and R° are 
free cytosolic GR density. 



mRNA° R 



( mRNA °) ' kd - R 



( 6 ) 

(7) 



the baseline values of receptor mRNA and 



Cluster 1 

Cluster 1 contained 23 probe sets representing 21 genes. This cluster 
had patterns similar to TAT with a simple induction with maximum 
at around 5 hr followed by a fast decline to baseline after 18 hr (Fig. 
5). Cluster 1 genes were all well-captured by the PD model assuming 
CS-enhanced transcription in the nucleus via DR{N). Fig. 5 presents 
the model structure, the differential equations, and three representative 
fittings of cluster 1 genes. Table 1 shows the estimated parameters 
for 10 selective probes. The mRNA degradation rate constant kd_ m 
represents the drug-independent property of the physiological system, 
and the linear stimulation factor S represents the drug specific property 
of the message. Detailed results, descriptions, and discussion for this 
cluster and the other 5 clusters can be found in the original paper [7]. 

Cluster 2 

Cluster 2 contained 9 probes coding for 6 genes. This cluster also 
showed a simple induction but with a maximum at around 10 hr followed 
by a slower return to baseline as late as 72 hr (Fig. 6), compared to 
the first cluster. The same model assuming induced transcription by 
DR(N ) was applied to all cluster 2 genes due to the strong similarities 
between cluster 1 and 2 temporal patterns. This implies that genes in 
these two cluster were regulated by CS based on the same mechanism. 
Fig. 6 presents the model structure, the differential equations, and three 
representative fittings of cluster 2 genes. 

Cluster 3 

Cluster 3 contained 21 probes representing 18 genes. This cluster 
showed a more complex pattern with an initial decline followed by de- 
layed increase (Fig. 7), suggesting two mechanisms might be involved 
in CS action. The initial decline was short-lived and was explained by a 
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Ihble 1 . Representative pharmacodynamic parameters for cluster 1 genes. 



GENE ACCESSION 

No. NAME NUMBER 


kd. m (hr *) 
Estimate! CV%) 


S(fmol/mg) 1 
Estimate(CV%) 


#1 Ornithine X07944exon#l-12-s.at 

Decarboxy- 
lase 


0.30(26%) 


0.021(11%) 


#2 Microtubule- U05784-s_at 
Associated 
Proteins 1A 
and IB 


0.38(21%) 


0.012(8%) 


#3 Hyaluronan rc.AI639246.at 
Receptor 
Lyve-1 HTC 


0.31(28%) 


0.015(11%) 


#4 Rat Y-Box D28557.s.at 
Binding 
Protein-a 
(RYB-a) 


0.20(29%) 


0.011(15%) 


#5 Cyclin G X70871.at 


0.43(28%) 


0.016(10%) 


#6 Matrix Gla rc.AI012030.at 
Protein 
(MGP) 


0.28(28%) 


0.020(12%) 


#7 Nucleosome rc.AA859920_at 
Assembly 
Protein 


0.24(26%) 


0.011(13%) 


#8 Mitochondrial D63411-s.at 
Precursor 
Receptor 


0.25(30%) 


0.008(14%) 


#9 Ornithine J04791.s_at 

Decarboxy- 
lase 


0.21(30%) 


0.031(15%) 


#10 Ornithine J04792.at 

Decarboxy- 
lase 


0.27(27%) 


0.053(12%) 



rapid stimulation of mRNA degradation in cytosol by steroid (DR). The 
predominant induction was assumed to be secondary to a CS-enhanced 
transcription factor (BS). The DR(N) enhanced the transcription of a 
BS gene in the nucleus, which translates to higher BS protein levels. 
The increase of BS leads to the delayed enhanced transcription of tar- 
get gene. There is strong evidence in the literature that for at least 
two genes, arginase and carbamyl phosphate synthetase, the delayed 
induction was secondary to the primary CS-enhanced transcription fac- 
tor C/EBP [24]. All cluster 3 genes were best captured by the proposed 
model. The mRNA for BS was assumed to be a cluster 1 gene to limit the 
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Modeling of Cluster 1 



•»Ds 



DR(N) 



♦ mRNA 



dmRNA =k lm (l + S- DR(N))- k a . • mRNA 
dt 

LC. * mRNA* = / 
Baseline : k t m ~kj m - mRNA 9 




Time (hr) 



Fig. 5. The proposed model, the differential equations, and three representative 
fittings of cluster 1 genes. Solid circles are the mean gene array data and bars are the 
standard deviations. Solid lines are fittings with the proposed model for each indi- 
vidual gene. DR(N) - drug-receptor complex in nucleus, mRNA - target mRNA level 
(normalized as ratio to 0 hr control), k,_ m - zero-order rate of the target mRNA syn- 
thesis, kd. m - first-order rate of the target mRNA degradation, S - linear stimulation 
factor, mRNAO - baseline mRNA level (normalized). 



number of estimated parameters. Its degradation rate constant ( kd_BSm ) 
and linear stimulation factor by DR(N ) ( Sssm ) were fixed according to 
the typical parameter values of cluster 1 genes. Fig. 7 presents the model 
structure, the differential equations, and three representative fittings of 
cluster 3 genes. 

Cluster 4 

Cluster 4 contained 66 probes coding for 49 genes. This cluster showed 
a longer down-regulation in the first 8 hours followed by even more sus- 
tained message induction almost throughout the 72 hour time period 
(Fig. 8). The long-lived decrease suggests that instead of rapid in- 
fluence on mRNA stability in cytosol, these genes may be suppressed 
by DR(N) binding to negative GREs on DNA. The secondary induc- 
tion may be caused by a CS-enhanced biosignal similar to cluster 3. 
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Modeling of Cluster 2 



DR(N) 
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k, m 



mRNA 




^ ,R ^ — « *,. (/ + S • DR(N))- k , . mRNA 
dt 

I.C. = mRNA* = / 
Baseline : k t m -k 4 m mRNA* 
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0 12 24 3ft 40 60 72 
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Fig. 6. The proposed model, the differential equations, and three representative 
fittings of cluster 2 genes. Solid circles are the mean gene array data and bars are 
the standard deviations. Solid lines are fittings with the proposed model for each 
individual gene. Symbols are defined as in Fig. 5. 



This biosignal may exhibit a delayed and prolonged message induction, 
similar to cluster 2 genes. About two-thirds of cluster 4 genes were 
best captured by the PD model assuming a repressed transcription by 
DR(N) followed by an enhanced transcription that was initiated by the 
steroid enhanced regulator BS. Figure 8 presents the model structure, 
the differential equations, and three representative fittings of cluster 4 
genes. The hypothetical compartment (BS a ) is the simplified form of 
the primary-response-product ( BS r ) in the cluster 3 model (Fig. 7). 
This simplification allowed illustration of the underlying transduction 
process without assuming the behavior of the CS-enhanced biosignal. 
The additional complexities of other cluster 4 genes were discussed in 
the original paper [7]. 

Cluster 5 

Cluster 5 contained 10 probe sets representing 5 genes. This cluster 
was a unique cluster showing a very fast induction with a peak at 30 
min followed by an abrupt decrease to base li ne after 4 hr (Fig. 9(b)). 
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DR 




Fig. 7. The proposed model, the differential equations, and three representative 
fittings of cluster 3 genes. Solid circles are the mean gene array data and bars are 
the standard deviations. Solid lines are fittings with the proposed model for each 
individual gene. DR drug-receptor complex in cytosol, DR(N) drug-receptor complex 
in nucleus, mRNABS the message level of the intermediate regulator BS (normalized 
as ratio to 0 hr control), BSr the protein level of the BS (normalized), mRNA target 
mRNA level (normalized), kr first-order rate of the drug-receptor complex translo- 
cation into the nucleus, k,.asm - zero-order rate of BS mRNA synthesis, ku.BSm - 
first-order rate of BS mRNA degradation, - first-order rate of BS translation 

to protein, fcd.BS - first-order rate of BS protein degradation, k a _ m - zero-order rate 
of the target mRNA synthesis, kd.m - first-order rate of the target mRNA degrada- 
tion, Sbsjti linear stimulation of fc s _esm by DR(N), S ,„ - linear stimulation of k 3 _ m 
by BSr, S m _d - linear stimulation of k,i_ m by DR, mRN A° BS - baseline BS mRNA 
level (normalized), BSr 0 - baseline BS protein level (normalized), mRNA 0 - baseline 
target mRNA level (normalized). 



These genes were immune -related and were unlikely to be expressed in 
hepatocytes. The fact that they were all sorted to the same cluster 
suggested that they had a common cellular origin and were regulated 
via the same mechanism. Since whole liver was used in our study, it is 
likely that the RNAs for these genes were from blood lymphocytes in 
the sample. The almost immediate message increase was unlikely to be 
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Modeling of Cluster 4 
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Pig. 8. The proposed model, the differential equations, and three representative 
fittings of cluster 4 genes. Solid circles sire the mean gene array data and bars sue 
the standard deviations. Solid lines are fittings with the proposed model for each 
individual gene. BSa - absolute change of regulator BS level from the control, k\ - 
first-order rate of the BSa production from DR(N), S - linear stimulation of fc,_ m by 
BSa, /C 50 .S - concentration of DR(N) at which mRNA synthesis rate drops to 50% 
of its baseline value. Other symbols are defined as in Pig. 5. 



GRE-mediated. Recent data suggest the possibility of rapid membrane- 
initiated genomic effects via specific steroid membrane receptors, and 
the intracellular signaling in lymphocytes [25, 26]. As CS also cause 
an acute effect on lymphocyte trafficking, it is possible that lymphocyte 
efflux from the liver bloodstream could account for the abrupt decrease 
of these immune-related message levels. All cluster 5 genes were best 
modeled combining the fast gene induction effect by DR in lymphoid 
cells and the direct lymphocyte trafficking effect of steroid. The total 
message level in liver sample (mRNA) is represented by the combina- 
tion of mRNA in a single lymphocyte (mRNAC) and total lymphocyte 
counts from the blood source in the sample (CellB) (multiplication). 
Fig. 9 presents the model structure, the differential equations, (a) the 
simulation of the change of blood lymphocyte counts (parameter values 
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from ref. [4]), and (b) three representative fittings of cluster 5 genes. 
After MPL administration, lymphocytes leave the blood stream, yielded 
a nadir count at 4.5 hr, and returned back to normal after 12 hr. This 
drop of lymphocyte number abruptly reduced the total mRNA expres- 
sion to even lower than baseline values for some of the genes around 5 




Tlmt (hr) 



Tlm« (hr) 



Fig. 9. The proposed model, the differential equations, (a) simulated immune 
cell trafficking profile after MPL 50 mg/kg intravenous injection in adrenalectomized 
rats, and (b) three representative fittings of cluster 5 genes. Solid circles are the 
mean gene array data and bars are the standard deviations. Solid lines are fittings 
with the proposed model for each individual gene. D - plasma MPL concentration, 
R - free GR density, DR - drug-receptor complex in cytosol, Celia - the lymphocyte 
counts in blood (normalized as ratio to 0 hr control) mRNAc the message level 
in single lymphocyte (normalized), mRNA - total message level in the liver sample 
(normalized), k on - second-order rate of the drug-receptor binding, - zero-order 

rate of lymphocyte trafficking from the immune tissues to the blood, k ou tjc - first- 
order rate of lymphocyte trafficking from the blood to the immune tissues, k s , m - 
zero-order rate of the mRNA synthesis in single lymphocyte, kd. m - first-order rate of 
the mRNA degradation in single lymphocyte, IC&o-C - MPL concentration at which 
kin.c drops to 50% of its baseline value, S - linear stimulation of k,. m by DR, Cell% 
- baseline lymphocyte counts in blood (normalized), mRNAc - baseline mRNA level 
in single lymphocyte (normalized). 
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hr (Fig. 9(b) #3). Once the cell counts were back to normal but the 
gene induction effect was still not fully diminished, a small second peak 
was observed in some of the genes around 10 hr (Fig. 9(b) #7). 



Modeling of Cluster 6 
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Fig. 10. The proposed model, the differential equations, and three representative 
fittings of cluster 5 genes. Solid circles tire the mean gene array data and bars are 
the standard deviations. Solid lines are fittingB with the proposed model for each 
individual gene. IC50 - concentration of DR(N) at which mRNA synthesis rate drops 
to 50% of its baseline value. Other symbols are defined as in Fig. 5. 



Cluster 6 

Cluster 6 contained 68 probes coding for 44 identifiable genes. About 
eighty percent of cluster 6 genes exhibited simple down-regulation and 
were well-captured by a model assuming repressed transcription by 
DR(N). Fig. 10 presents the model structure, the differential equations, 
and three representative fittings of cluster 6 genes. The additional com- 
plexities of other cluster 4 genes were discussed in the original paper [7]. 
Of special note, several genes such as ornithine decarboxylase (cluster 
1) and hydroxy steroid sulfotransferase (cluster 4) were represented by 
multiple different probes on the gene arrays. Reasonable concordance 
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Table 2. Observed pharmacogenomic effects of CS (from [5,6]). 

Therapeutic Effects Immunosuppression 
Adverse Effects Blood Clotting 

Dyslipidemia 
Carbohydrate Metabolism 
Amino Acid Metabolism 
Xenobiotic Metabolism 



in profiles and dynamic parameters from multiple probes indicated a 
degree of reproducibility in results. 

The observed CS-responsive genes relate to a variety of biological pro- 
cesses (Table 2). In this study, we analyzed genes individually. These 
models serve to provide hypotheses on how mRNA expression is con- 
trolled by direct and secondary factors. Text mining studies may identify 
biomedical literature that are relevant to the individual genes, and more 
importantly, the regulatory/functional relationship between the genes, 
the drug, and the tissue. Such efforts may lead to further integrated 
models incorporating multiple gene inter-regulations that will provide 
additional insights into signaling networks at molecular, cellular, and 
systemic levels. 



SUMMARY 

There are multiple potential sites in the pathway from DNA to pro- 
tein that are available for regulation (Fig. 11) [27], The CS can regu- 
late gene expression at various steps, alone or jointly with other tran- 
scription factors and/or hormones. Our microarray dataset contains 
substantial information about the mechanisms and extent of CS effects 
on various genes. The six major dynamic response patterns suggest 
that a limited number of control processes account for pharmacoge- 
nomic effects of steroids. The generalized mathematical models for re- 
ceptor/gene/transduction dynamics facilitate the understanding of the 
global picture of CS actions and provide new insights for microarray 
data analysis. Pharmacogenomics provides an opportunity to exam- 
ine multiple factors and mechanisms affecting the diverse molecular to 
whole-body actions of drugs. The PK/PD/PG models of the future 
will further integrate pharmacokinetics, molecular biology and systems 
pharmacology. 
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Fig. 11. Steps in the pathway from DNA to protein that a»e available for regula- 
tion. 
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INTRODUCTION 

The work of Teorell (e.g., [1]) provides the earliest comprehensive 
account of the physiological factors that influence the absorption, dis- 
tribution, metabolism and elimination of drugs. While Bellman, Kal- 
aba and Jacquez were the first to introduce the idea of physiologically- 
based pharmacokinetic (PBPK) modeling in 1960 [2], it was Bischoff and 
Dedrick and their collaborators who fully developed and applied these 
ideas to important therapeutic compounds (e.g., [3,4]). Rowland and his 
colleagues have advanced this methodology in part through the incor- 
poration of physiological transport and macro-molecule binding mech- 
anisms, and thereby have further established the importance of PBPK 
modeling in drug development (e.g., [5,6]). Other important contribu- 
tions that have added significantly to the physiological framework for 
preclinical drug development, including the work of Weiss(e.g., [7]) on 
the stochastic theory of drug transport and the modeling approaches 
that incorporate fundamental mechanisms governing drug transport in 
organs that have been developed and applied by Pang to the liver and 
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kidney (e.g., [8]). The pharmacodynamic modeling methods developed 
by Sheiner and Holford (e.g., [9]), Jusko (e.g., [10]) and by Danhoff 
(e.g., [11]) now provide a powerful quantitative framework for studying 
the biological action of drugs in vivo. These and other formal modeling 
approaches codify fundamental PK/PD principles and provide a formal 
basis for translating the knowledge gained in nonclinical drug studies to 
human drug trials. 

While methods for Bayesian inference were first applied in pharma- 
cokinetics to the clinical problem of dose regimen design [12], Bayesian 
estimation has now been used in all phases of drug development. In this 
chapter, we illustrate the application of Bayesian methods to the de- 
velopment of a physiologically-based PK/molecular PD model for an 
anticancer agent. Maximum a posterior (MAP) estimation is used, 
which incorporates parameters for which prior information is available 
(e.g., parameters representing system-specific properties such as organ 
blood Hows, organ volume fractions, receptor density, cellular signal- 
ing protein turnover), as well as parameters with no prior information 
(e.g., parameters representing drug-specific properties such as parti- 
tion coefficients, intrinsic clearance, transport and binding constants). 
While Bayesian methods have been applied previously in PBPK mod- 
eling within a hierarchical population modeling framework (e.g. [13, 
14]), the MAP estimation approach used herein provides a simple-to-use 
method for incorporating informative and non-informative prior distri- 
butions in PBPK and mechanistic PK/PD modeling, and can be im- 
plemented using most nonlinear regression software. This approach 
is applied to develop a whole-body physiologically-based pharmacoki- 
netic model of the ansamycin benzoquinone antibiotic 17-(allylamino)- 
17-demethoxygeldanamycin (17AAG) and its active metabolite 17-(ami- 
no)- 17-demethoxygeldanamycin (17AG) in blood, normal organs (lung, 
brain, heart, spleen, liver, kidney, skeleton muscle) and implanted hu- 
man tumor xenograft in nude mice [15]. 

17-(allylamino)-17-demethoxygeldanamycin (17AAG) is a derivative 
of the benzoquinone ansamycin antibiotic geldanamycin and is currently 
in phase I testing [16]. Geldanamycin is a potent antiproliferative agent 
that produces growth arrest in the G1 phase of most tumor cell lines [17]; 
however, it is also associated with dose-limiting hepatotoxicity. In vitro 
and animal studies have shown that while the antiproliferative activity 
and mechanism of action of 17AAG is similar to that of geldanamycin 
[18-20], it is considerably less hepatotoxic than its precursor [21]. 

Preclinical pharmacology studies of 17AAG, including metabolism 
studies in mouse and human hepatic preparations, reveal several metabo- 
lites including the active CYP3A4 product 17-(amino)-17-demethoxy- 
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geldanamycin (17AG) [22]. Murine pharmacokinetic studies of intra- 
venous 17AAG show that both 17AAG and 17AG are widely distributed 
to tissues where they are present in most tissues at substantial concen- 
trations, with only small amounts of each compound detected in urine 
[23]. Compartmental and non-compartmental analyses of plasma data 
reported in mice indicate linear ki netics over a dose range of 27 to 40 
mg/kg, with a total systemic clearance of 17AAG in the range 54 to 74 
ml/min/kg [23]. 

Extensive in vitro studies indicate that the antiproliferative activity 
and mechanism of action of ansamycin antibiotics, including 17AAG, 
correlate with their ability to deplete oncoproteins such as pl85 erbB2 , 
mutant p53 and Raf-1 [24,25]. The proto-onco-proteins pl85 erbB2 and 
Raf-1 are essential for transmission and amplification of mitogenic sig- 
nals from the cell surface to nucleus in normal cells and both are over- 
expressed in certain tumor cell lines. The specific mechanism of oncopro- 
tein depletion by ansamycin antibiotics is thought to be related to their 
ability to disrupt the molecular chaperone function of heat shock pro- 
tein HSP90 and its homologue GRP94, causing the degradation of their 
client proteins that require the chaperones for maturation and stability 
[25-30], 

The work reported in this chapter builds on the previously reported 
preclinical pharmacology studies of 17AAG, known mechanisms of ac- 
tion of ansamycin antibiotics, and current understanding of pathways 
involved with the auto-regulation of heat shock proteins. The goal of 
the present study was to apply Bayesian estimation methods that allow 
the combination of available prior information, to develop a physiological 
pharmacokinetic and molecular pharmacodynamic modeling of 17AAG 
and 17AG. This was accomplished by: 1) using data from normal mice 
to develop a PBPK model of 17AAG and its primary active metabo- 
lite 17AG; 2) using 17AAG and 17AG concentration measurements in 
the tumors of nude mice bearing human breast cancer xenografts to de- 
velop a model for the uptake and distribution of these compounds in the 
tumor; 3) using measurements of pl85 erbB2 , Raf-1 and heat shock pro- 
teins (HSP70 and HSP90) in the tumor xenografts to develop molecular 
pharmacodynamic models relating the cellular concentrations of 17AAG 
and 17AG to the targeted onco-proteins and heat shock proteins based 
on the known molecular mechanisms of drug action. The work culmi- 
nates in a complete PBPK/PD model for the investigational anticancer 
agent 17AAG and its metabolite 17AG, and provides insights into the 
molecular mechanism of action of these compounds in vivo. 
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METHODS 
Bayesian Estimation 

The PK/PD of the drug under study can be modeled as a continuous 
dynamic system with discrete time measurements of the drug’s kinetic 
and/or dynamic responses, as given below: 

= f(x(t),a,r(t),t), x(0) = c 

z(tj ) = j = (1) 

where the state vector x(t) (nxl) can represent compartment amounts, 
tissue concentrations, cell concentrations, effect site concentrations, phys- 
iological variables, ligand concentrations, signaling pathway proteins, 
etc. In Eq. (1), the vector r(t ) (A; x 1) represents drug infusion regimens 
(allowing for administration and modeling of multiple drugs) as well as 
any patient-specific covariates (e.g., clinical variables affecting PK/PD 
processes). In addition, drugs can be administered as bolus inputs, rep- 
resented by the vector b(t) (nxl), that are assumed to change states 
instantaneously (i.e., x(dtj) = x(dt~) + b(dti), i = 1,...,). The vector 
a (p x 1) represents the collection of all model parameters (rate con- 
stants, distribution volumes, partition coefficients, effect site clearances, 
maximum responses, biomarker production and turn over rates, etc.), 
and is random with known probability density function p(a). Mea- 
surements of the drug’s kinetic and/or dynamic responses are repre- 
sented by the / x 1 vectors z(tj),j = 1, . . . , m. It is assumed that these 
measurements are subject to random errors that are independent with 
zero mean. For notational convenience the vector z (ml x 1) is used 
to represent the measurements ( z — [z(t\) T , . . . ,z(t m ) T ] T ), e (ml x 1) 
the errors (e = [e(ti) T , . . . ,e(t m ) T ] T ), and y(a,t ) the model outputs 
h(x(t),a,r(t),t). It is further assumed that Var(e{(tj )) = gi(y(ct,tj)), 
j = 1, . . . , m, * = 1, . . . , /, where the functions gi are known. Given this 
pharmacokinetic/pharmacodynamic model formulation, Bayes’ theorem 
is applied to estimate the posterior distribution of the model parame- 
ters p(a.\z) given the measurements and the prior distribution: p(a\z) = 
l(ot\z)p(a) / f l(a\z)p(a)da. The l ik elihood function l(a\z) is readily ob- 
tained given the model definition above and an assumption about the 
error distribution. 

In the work reported herein MAP estimation has been used, incorpo- 
rating parameters for which prior information (mean and covariance) is 
available as well as parameters with no prior information (i.e., parame- 
ters with non-informative priors). For example, for a perfusion-limited 
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organ model, prior information is available for organ blood flow while 
no prior information is available for the 17AAG or 17AG partition co- 
efficients. In general, if a represents the vector of all parameters to be 
estimated, and Qi is the subset of parameters with known distribution 
Pi (ai) and a 2 is an independent sub-set of parameters with distribution 
P2 ( 02)1 then the MAP estimate of a is: 

<*map = argmax{ln/(ai,a 2 |2) + lnpi(Q!i) + lnp2(a2)} (2) 

where z is the vector of measured data and l (c*i, a 2 |2) is the likelihood 
function. If p 2 ( 0 : 2 ) is non-informative, then: 

olmap = argmax{ln/(ai,a 2 | 2 )-l-lnpi(ai)} (3) 

Equation (3) can be solved readily when the lik elihood is normally dis- 
tributed and p\{ot\) is either normally or log-normally (used herein) 
distributed. 

In the modeling steps described below, model parameters were esti- 
mated using either Maximum L ikelihood estimation (ML) when no prior 
information was available for any of the model parameters, or MAP es- 
timation when prior information was available for at least one of the 
model parameters. Both ML and MAP estimation were performed us- 
ing the ADAPT pharmacokinetic/pharmacodynamic systems analysis 
software [31]. Model selection was guided using the Akiake information 
criterion (AIC) for ML estimation and generalized information criterion 
(GEN-IC) for MAP estimation (see [31]). 

Animal Experiments 

In experiments on non-tumor-bearing mice, previously reported in 
[23], 48 adult female CD2L1 mice were divided into 16 time groups (one 
group as vehicle control) for terminal sampling of plasma, red blood cells 
(RBC), lung, brain, heart, spleen, liver, kidney and skeleton muscle at 5, 
10, 15, 30, 45 min and 1, 1.5, 2, 3, 4, 6, 7, 16, 24, 48 hr following dmg ad- 
ministration. In a subsequent set of experiments on tumor-bearing mice, 
16 NCR SCID adult female mice (weighing 17.3 - 21.0 gm) with MDA- 
MB-453 human breast cancer xenografts in their flank were divided into 
8 time groups (one group as vehicle control) for terminal sampling of 
plasma, lung, heart, spleen, liver, kidney and tumor at 2, 4, 7, 16, 24, 48 
and 72 hr following drug administration. In both experiments, 17AAG 
was injected as an intravenous bolus via lateral tail vein and at a dose 
of 40 mg/kg fasted body weight. 

Plasma and tissue concentrations of 17AAG and 17AG were deter- 
mined by HPLC (see [23] for details). The concentrations of Raf-1, 
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pl85 erbB2 , HSP90 and HSP70 in tumor tissues were determined by west- 
ern blot analysis. The western blot band density readings were first nor- 
malized to actin and then referenced to the average normalized reading 
of the controls. 

The concentration profiles of 17AAG and 17AG in venous RBC and 
plasma indicated that a rapid equilibrium was achieved between the 
two fractions in blood for both compounds. Accordingly, the RBC and 
plasma concentrations of 17AAG and 17AG from the non-tumor ex- 
periments were used to calculate the RBC-plasma partition coefficient 
(R = CRBc/Cpiasma ) for each compound: R aag =2A7±1.23 (mean±SD, 
n = 30) and R AG = 5.86 ± 2.88 (n = 30). The unbound fractions of 
17AAG and 17AG in plasma (fu p = C p i a sma,free/C p i a3 ma ) were mea- 
sured in a separate in vitro study with the following results: fu PtAAG = 
0.063±0.015 (n = 10) and fu p>AG = 0.081 ±0.042 (n = 10) [23]. The un- 
bound free fractions in blood were then calculate to be f u b,AAG = 0.0209 
and fub'AG = 0.0140, assuming a hematocrit of 0.45. 



PHYSIOLOGICALLY-BASED 
PHARMACOKINETIC MODELING 

Non-Tumor Bearing Mice 

The PBPK model for 17AAG and 17AG in non-tumor-bearing mice 
was constructed through the following six steps: 1) fit sums of expo- 
nentials to the venous blood data of 17AAG and 17AG separately; 2) 
simultaneously model the distribution of 17AAG and 17AG in the lung 
using the fitted venous exponential functions as input, and predict the 
arterial concentration-time profile for each compound; 3) in each non- 
eliminating organ, simultaneously model the distribution of 17AAG and 
17AG using the predicted arterial concentration-time profiles as input; 
4) simultaneously model the liver distribution and metabolism of both 
compounds; 5) construct a whole-body model from the foregoing results 
and model the distribution in the unsampled tissues (Misc.); and 6) 
simulate the resulting whole-body model to predict the concentration 
profiles of 17AAG and 17AG in blood and all tissues. 

In modeling the individual organs, both perfusion-limited and diffusion- 
limited models were evaluated to describe the distribution of 17AAG 
or 17AG in each organ. Different combinations of the perfusion- and 
diffusion-limited models for the distribution of 17AAG and 17AG in the 
same organ were evaluated by fitting simultaneously to the measure- 
ments of both compounds. The models of 17AAG and 17AG in the 
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same organ shared the same set of anatomical and physiological param- 
eters. 

The liver is the only eliminating organ of significance for 17AAG and 
17AG [23]. No prior information is available on the intrinsic clearances of 
17AAG (< CLintAAG ) and 17AG ( CLintAG ) in the liver, or on the fraction 
of CLintAAG responsible for the formation of 17AG ( fm ). Furthermore, 
these parameters cannot be uniquely determined via individual organ 
model-based estimation using solely the measured liver concentration 
data. To solve this problem of unidentifiability of hepatic parameters, 
the following equation relating the systemic clearance to the intrinsic 
clearance of 17AAG in the liver was derived: 



CLsaag = 



QVv fUb,AAGkv,EV,AAG 

<3(1 + ffVgV - ) + Vv fnb,AAG^V,EV,AAG 



(4) 



where Q, Vv, ky t EV and ksvy are parameters for the diffusion-limited 
model used to describe the distribution of 17AAG in the liver (see be- 
low). The systemic clearance of 17AAG ( CLsaag ) was calculated as 
dose divided by the area under the predicted arterial concentration-time 
profile of 17AAG. A similar equation for the metabolite 17AG leads to 
the following equation for the case of a perfusion-limited model: 



AUC(C Ar t,AG) 



f m C LintAAGAU C ( Cev,aag ) 
CLintAGRAG 



(5) 



The area under the arterial concentration-time curve of 17AG ( AUC - 
( Cah,ag )) was calculated from the predicted arterial concentration pro- 
file of 17AG. With these two additional relationships, the drug-specific 
kinetic parameters of liver can be uniquely estimated for the composite 
liver model using only the measured liver concentration data. 



Tumor-Bearing Mice 

A whole-body model for the distribution of 17AAG and 17AG in 
tumor-bearing mice was constmcted by adding a parallel vascular bed 
representing the tumor to the whole-body model developed for the non- 
tumor-bearing mice. All kinetic and physiological parameters for normal 
organs were fixed at the values determined from the non-tumor model, 
and the anatomical parameters were replaced by those measured in the 
tumor animal experiments. It was assumed in this approach that the 
animal physiology and drug transport ki netics in tumor-bearing mice 
were the same as those in non-tumor-bearing mice. 

A number of different diffusion-limited models were evaluated to de- 
scribe the tumor distribution of 17AAG and 17AG, including a model 
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Fig. 1. Tumor Model. The subscripts V, IT and C represent the vascular space, 
interstitial space and cellular space; Psv.it represents the permeability surface-area 
product of the vascular wall; kiT.c and kc.iT are the transport rate constants across 
the cell membrane. 

with a vascular and an extra-vascular space, and a model with an extra- 
cellular and cellular space, as well as several models containing vascular, 
interstitial and cellular spaces. One of the tested models is shown in Fig. 
1, with Eqs. (6-9) describing the drug concentration (17AAG or 17AG) 
in each of the tumor spaces and the measured tissue concentration. 



Q (Cah ~ Cy) - PSv,it ( Cy — Cjt ) (6) 

PSv,it{Cv ~ Cit ) - kiT,cCirVir + kcjrCcVc (7) 

kiT,cCirViT - kcjrCcVc (8) 

CitVit + CqVc 

ViT + Vc W 

After incorporating the candidate tumor models into the whole-body 
model for the two compounds, the parameters of the tumor models were 
estimated by fitting the model predictions to the measured tumor tissue 
concentrations of 17AAG and 17AG simultaneously. 

Results: Whole-Body Model 

The composite whole-body model for 17AAG-17AG in tumor-bearing 
mice is depicted in Fig. 2. The individual organ models for the up- 
take of 17AAG incorporate diffusion-limited exchange for all organs, 
while perfusion-limited models describe 17AG uptake in all organs. Es- 
timates of the model parameters (with the exception of liver and tumor 
which are presented below) are given in Table 1 (physiological param- 
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eters) and Table 2 (drug- specific kinetic parameters). Table 2 lists the 
vascular-to-extra- vascular clearance (CLy,EV — ky,EVvV fub) and the 
extra-vascular-to-vascular clearance (CLev,v = kEV, VVev) for each 
diffusion-limited organ as well as the partition coefficient value for each 
perfusion-limited organ. Recall that while the protein binding of 17AAG 
and 17AG was measured in the blood and incorporated in the model, 
it was assumed that all 17AAG and 17AG in tissue are available for 
extra-vascular-to-vascular transport. The predicted tissue concentra- 
tions from the whole-body model for non-tumor-bearing mice and the 
measured tissue concentrations for 17AAG and 17AG are shown in Figs. 
3 and 4. 



Table 1. Individual Organ Physiological Parameters-Model 
Estimates and Literature Values. 



Organ 


Organ Blood Flow Q 
(as % cardiac output) 


Vascular Volume V v 
(as % of organ volume) 


Estimate 


Population 
Mean ± SD 


Estimate 


Population 
Mean ± SD 


Lung 


- 


100 


40.5 


50.0 ±11.0 


Brain 


2.91 


3.35 ± 1.04 


2.2 


3.0 ± 1.2 


Heart 


6.94 


6.72 ± 2.66 


2.8 


4.0 ± 1.6 


Spleen 


0.28 


0.42 ± 0.32 


17.1 


17.0 ± 1.0 


Kidney 


11.40 


11.10 ± 0.05 


15.3 


24.0 ±11.0 


Muscle 


31.09 


17.98 ± 8.85 


3.2 


4.0 ± 1.4 


Misc 


28.74 


- 


4.1 


4.0 ± 1.6 



Table 2. Individual Organ Model Kinetic Parameter Estimates. 



Organ 


17AAG 


17AG 


| CLv.ev 


CLev,v 


kv.EV 


kEV.V 


R 


(ml/hr) 


(hr~ 


') 


Lung 


12.10 


3.67 


5452 


23.47 


2.24 


Brain 


2.12 


21.32 


11640 


54.43 


0.03 


Heart 


3.44 


2.73 


46500 


22.54 


0.76 


Spleen 


0.14 


0.10 


554.2 


1.60 


1.19 


Kidney 


3.08 


1.07 


3369 


4.39 


2.07 


Muscle 


22.41 


26.78 


5067 


4.15 


0.27 


Misc. 


19.35 


15.30 


3537 


2.49 


0.91 



The model selected to describe the distribution, metabolism and se- 
cretion of 17AAG and 17AG in liver is depicted in Fig. 5 and is described 
by the following equations: 
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Fig. 2. Composite whole-body model for 17AAG and 17AG in tumor-bearing mice. 
The acronyms for the organs used in equations are: LU: lung, BR: brain, HT: heart, 
SP: spleen, LI: liver, KI: kidney, MU: muscle, Art: arterial blood and Ven: venous 
blood. 
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Fig. 3. Whole-body model simulation results for 17AAG. 



rr dCvAAG 

V V "77 



Vev 



dCEV,AAG 

dt 



v dC AG 

dt 



( Qu - Qsp)CATt,AAG + QspCsp,Ven,AAG 
~QuCv,AAG ~ ky,EV fUb,AAGCv,AAGVv (10) 
kv,EvfUb,AAGCv,AAGVv 
-CLint AA GCEV,AAG (11) 

(Qu ~ Qsp)CArt,AG + QspCsP,Ven,AG 
~ CLintAGCAG 

Rag 

+fmCLintAAGCEV,AAG ( 12 ) 



The estimated model parameters are listed in Table 3 and Fig. 6 shows 
the liver concentration-time profiles of 17AAG and 17AG predicted by 
the estimated whole-body non-tumor mouse model. 

The uptake of 17AAG in liver tissue was found to be diffusion-limited 
with no significant transport from the extra-vascular space to the vas- 
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Fig. 4. Whole-body model simulation results for 17AG. 



Table 3. Individual Organ Model Ki- 
netic Parameter Estimates. 



Parameter 


Unit 


17AAG 


17AG 


kv,BV 


hr- 1 


6132 


- 


CLv,bv 


ml/hr 


27.45 


- 


R 


- 




4.87 


CLint 


ml/hr 


4.93 


3.34 


fm 




0.40 


- 


Q a 


% CO 


18.92 




v b 

Vy 


% VLI 


23.9 





a. Literature mean ± SD = 16.10 ± 2.45; 

b. Literature mean ± SD = 31.0 ± 7.0. 



cular space ( kEvy — 0). This result is supported by in vitro studies of 
17AAG metabolism in mice and human hepatic preparations, in which 
the metabolic activity was found to reside predominantly in microsomes 
with little or no metabolism of 17AAG in liver cytosol [22], Our model 
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Fig. 6 . Liver model diagram of 17AAG and 17AG. 




Fig. 8 . Whole-body model simulation result of 17AAG and 17AG concentrations 
in liver. 

suggests that upon entering hepatocytes, 17AAG is either trapped in 
microsomes where it is metabolized or secreted into bile. Interestingly, 
with only the difference of a propenyl group (-CH2-CH=CH2) on the 
side chain of the ansamycin ring, 17AG produced in microsomes can 
be released from microsomes and transported out of hepatocytes into 
blood circulation. 17AG can be secreted into bile but cannot be further 
metabolized [22]. A perfusion-limited model was found to adequately 
describe the distribution of 17AG in the liver. The hepatic metabolism 
of 17AAG is complex, with metabolites, including 17AG, found in the 
plasma and bile of mice treated with 17AAG [22,23]. However, neither 
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Fig. 7. Tumor model results. Model predictions (solid lines) and measurements 
(symbols) of 17AAG (left) and 17AG (right) concentrations in tumor (interstitial and 
cellular space). The dashed lines are the model predictions of the concentrations of 
17AAG and 17AG in tumor cells. 

the identity of the metabolites nor the metabolic or secretory pathways 
have been fully characterized. By simultaneously modeling the distri- 
bution and elimination of 17AAG and 17AG in liver, we were able to 
estimate that 40% of the total intrinsic clearance of 17AAG involves the 
formation of its metabolite 17AG. 

Results: Tumor Model 

Of the five different models postulated to describe the uptake of 
17AAG and 17AG in the human breast tumor xenografts, the model 
shown in Fig. 1 was selected for both 17AAG and 17AG. The result- 
ing estimates of the vascular-interstitial permeability-surface area prod- 
uct {PSvjt) as well as the interstitial-cellular exchange rate constants 
(k/T,c and kcjr) are listed in Table 4 for both 17AAG and 17AG. 
Table V lists the model estimates of tumor vascular and interstitial vol- 
umes. Because the distribution of both 17AAG and 17AG in tumor was 
diffusion-limited, there was no information in the data on tumor blood 
flow. Tumor blood flow thus was fixed at the population mean value 
and not estimated as the other physiological parameters. The 17AAG 
and 17AG tumor tissue and cell concentrations predicted from the com- 
posite whole -body model with tumor are shown in Fig. 7 together with 
the tumor tissue measurements. 



MOLECULAR PHARMACODYNAMIC 
MODELING 

The concentration profiles of 17AAG and 17AG in the tumor cells 
predicated from the whole -body PBPK model with tumor were used to 
model the action of these compounds on the tumor cell onco-proteins 
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(pl85 erbB2 and Raf-1) and heat shock proteins (HSP70 and HSP90). 
These ansamycins were shown to be equally potent in depleting Raf-1 
[19], and were assumed to also exhibit equal potency in their actions on 
pl85 erbB2 , HSP70 and HSP90. 



Onco-Protein Responses 

The role of HSP90 in chaperoning Raf-1 is summarized in Fig. 8. Af- 
ter HSP90 binds to newly synthesized Raf-1, the HSP90-Raf-l complex 
is transported to the cell membrane where it is available for activation by 
Ras. Following activation, the HSP90-Raf-l complex binds to unknown 
cyto-skeletal elements, activates down-stream kineses and promotes tu- 
mor growth [29]. Ansamycins, including 17AAG, are thought to act by 
inducing the degradation of the HSP90-Raf-l complex at each of its three 
sites equally, and by inhibiting transport of the complex as shown in Fig. 
8 [29]. Since the net action of ansamycins is to enhance the proteaso- 
mal degradation of Raf-1 and since the cellular Raf-1 measured in our 
experiments is the sum of the protein at the three sites, the simplified 
indirect response model shown in Fig. 9 and Eq. (10) were postulated 
to explain the response of Raf-1 to 17AAG and 17 AG. A similar model 
was used for the onco-protein p!85 erbB2 . 



d °-R 

~di~ Hsyn 



kdg,bi\ T 



E max (AAG + AG) 
EC bQ + {AAG + AG) ] 



(13) 



Heat Shock Protein Responses 

The molecular mechanism of heat shock protein auto-regulation and 
the responses of HSP70 and HSP90 following exposure to 17AAG and 
17AG are depicted in Fig. 10. The transcription of HSP genes and 
the resulting synthesis of HSPs are enhanced upon exposure of cells to 
proteotoxic stress such as the presence of ansamycin antibiotics. This 
transcriptional enhancement is due to the activation of heat shock tran- 
scription factor 1 (HSF1), which is capable of specifically binding to the 
HSP genes’ promoter sequence and thereby enhancing the transcription 
of HSP genes. While the synthesis of HSPs is up regulated by active 
HSF1, the transcriptional activity of HSF1 is down regulated by direct 
binding of HSP70, HSP90 and other co-factors [32-35]. We have hypoth- 
esized that this auto-regulatory system is responsible for the heat shock 
response measured in the mouse tumor xenograft following 17AAG ad- 
ministration. Based on the signaling pathway shown in Fig. 10, the 
feedback control model described in Fig. 1 1 was constructed for which 
the following equations can be written: 
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Cell 

Membrane 



=i inhibit ^ HSP90 

* degradation V/\/\ Raf-1 protein 




cyto-skeieton 



Fig. 8, Molecular mechanism of Raf-1 depletion by ansamycin antibiotics. 1: 
The newly synthesized Raf-1 protein complexes with HSP90; 2: The Raf-l-HSP90 
complex in cytosol is transported to the cell membrane; 3: HSP90 remains bound 
to Raf-1 when Raf-1 is recruited and activated by Ras; 4: HSP90 remains bound to 
Raf-1 when Raf-1 binds to unknown cyto-skeletal elements and activates down-stream 
kinases. Ansamycins destabilize the cytosolic, Ras-associated and skeleton-associated 
Raf-1. The disappearances of Raf-1 from the 3 fractions are equal. Raf-1 synthesis 
in ansamycin treated cells is elevated (approximately 3-fold) and ansamycins inhibit 
the transportation of Raf-l-HSP90 complex. 




Fig. 9. Indirect response model for Raf-1 and erbB2. O represents the onco- 
proteins Raf-1 and pl85 crbB2 with the associated parameters unique to each onco- 
protein; represents the zero-order synthesis rate of the protein; ka g ,b represents the 
basal first-order degradation rate of the protein; ( A AG+AG) is the total concentration 
of 17AAG and 17AG in tumor cells. 



dHSP70 P1 

dt 

dHSP70pj 

dt 

dHSPlQ 

dt 



R™ n HSF 1 - k 7Q HSP70 Pl (14) 

k7oHSP70p( i _ 1 ) — kjoH SP70pi i = 2, . . . ,n(15) 



k 70 HSP70 Pn - khg b (l 

eZMag + ag) 



+ 



EC™ + (AAG + AG) 



(16) 
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dHSF 1 
dt 



Fact ~ kteactAiHSPW + HSP90)HSFl (17) 



A time-dependent transduction system [36] was adopted to model 
the time delay in HSP70 gene expression (Eqs. (11,12)). The initial 
values of HSP70pi ( i — l,...,n) were assumed to be equal and were 
estimated. The initial values of HSP70 and HSF1 were set at one. The 
differential equations for HSP90 follow the same format as those for 
HSP70 (Eqs. (11-14)). The activation function of HSF1, in Eq. 
(14), is depicted in detail in Fig. 11. It is a function of HSP70 and 
involves several parameters including the basal activation rate (Ract,b) 
and the amplification factor (A). The function F ac t increases as HSP70 
decreases below its basel in e value (HSP70&). When HSP70 exceeds this 
baseline, the rate of activation of HSF1 remains at its basal level (Ract,b)- 
The rate of de-activation of HSF1 is a function of the HSP70 and HSP90 
concentrations. The amplification factor A was assumed to be the same 
for activation and de-activation. 

Results: The Onco-Proteins Raf-1 and pl85 erbB2 

The model predictions and the measured values of Raf-1 and pl85 erbS2 
in the tumor are shown in Fig. 12. Fong-term model simulation predicts 
that the level of Raf-1 in tumor returns to its base value at approximately 
200 hour, while plSS ^ 82 returns to its pre-drug value at about 120 hours 
(graphs not shown). Estimated model parameters are listed in Table 4 
and compared with corresponding in vitro literature values. The half- 
lives of Raf-1 and pl85 erbB2 and EC50 of Raf-1 estimated from the model 
compare favorably to the values reported from in vitro experiments. The 
degradation rate of plSS ^ 82 was estimated to be linearly related to 
the concentration of 17AAG and 17AG, so no EC 50 was estimated for 
pl85 erbB2 depletion. 

Results: Heat Shock Proteins - HSP70 and HSP90 

The measurements and model predictions of HSP70 and HSP90 in 
tumor are shown in Fig. 13. Fong-term model simulation predicts that 
the level of HSP70 in tumor returns to its base lin e value at approximately 
200 hour, while HSP90 returns to its pre-drug value at about around 300 
hours (graphs not shown). Table 5 lists the resulting estimates of the 
heat shock protein model parameters. There is no report in the literature 
from either in vivo or in vitro studies that provide a basis for comparing 
the results from our analysis. 
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17AAG + 17AG 




V V - 

Fig. 10. The auto-regulation of HSP70 and HSP90 through HSF1 - a gene- 
transcription-mediated feedback control system. HSF1 exists in the control state 
as inert monomers. Upon stress (such as exposure to 17AAG and 17AG), the concen- 
tration of HSP70 is reduced. HSF1 loses contact with HSP70, and trimerizes. The 
transcriptionally active trimers of HSF1 bind to heat shock protein genes and lead to 
the production of HSP70 and HSP90 proteins. The transcriptional activity of HSF1 
is repressed by direct binding of HSP70 and HSP90 to the HSF1 trimer, and HSF1 
eventually returns to its inert monomer state. 



Table 4. Onco-Protein Models Parameter Estimates and Literature Values. 



Parameter 


Unit 


Raf-1 


pl85 crbB2 


Model 

Estimation 


Literature 

Values 


Model 

Estimation 


Literature 

Values 


l l/2 


hr 


22.61 


11 , 17.5 


8.63 


7, 9.5 


EC50 


nM 


3.47 


3.4 


- 


- 


Rsyn 




0.03 


- 


0.08 




Bmax 


/hr | 


4.30 


- 


0.24 6 


- 



° Half-life of endogenous protein turnover (<j /2 = ln2/kd g ,b from Fig. 9); 
b For pl85 erbB2 , a linear model relating drug induced degradation to total drug 
concentration was used in the place of the E m OI model. In the table, Emax 
represents this scale factor. 




123 




Fig. 11. Model diagram of HSP auto-regulation upon treatment of 17AAG and 
17AG. The HSP70 and HSP90 represent the normalized concentration of the two heat 
shock proteins; R S yn is the basal zero-order synthesis rate and kd g , b is the basal first- 
order degradation rate; HSPpi - HSPp n are the transit compartments representing 
the protein synthesis delay for HSP70 and HSP90; k 7a and k go are the transit rate 
constants where the total delay time r = n/k (21); F act is the activation function of 
HSF1 transcriptional activity, in which Ra C t,b is the basal and zero-order activation 
rates, and HSP70b is the HSP70 concentration at control condition; kdeact is the 
basal first-order de-activation rate of HSF1 transcriptional activity; A is the scalar 
representing the amplification effect of gene activation and deactivation. 




Time (hours) Time (hours) 

Fig. 12. Onco-protein model predictions and measurements. 

DISCUSSION AND CONCLUSION 

Distribution of 17AAG and 17AG in Individual Organs 

The model and the data agree reasonably well for 17AAG and 17AG 
in venous blood and all organs, except for 17AG in brain and lung. 
The measured values of 17AG concentration in brain tissue fall mostly 
on the edge of the lower limit of quantification, and the measurements 
are associated with large standard deviation. While there is no report 
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Fig. 13. Heat shock protein model predictions and measurements. 



Table 5. Heat-Shock Protein Model Parameter Estimates. 



Parameter 


Unit 


HSP70 

Model Estimation 


HSP90 

Model Estimation 


b/2 


hr 


4.51 


40.43 


ECso 


nM 


0.90 


38.68 


Rsyn 


hr_i 


0.15 


0.02 


Emcix 


- 


1.64 


31.88 


T 


hr 


13.61 


0 


Eact,b 


- 


0.08 


0.08 


kdeact 


hr_i 


0.04 


0.04 


A 


- 


331.1 


331.1 



a. Half-life of endogenous protein turnover (fi/ 2 = ln2/kd g ,b from Fig. 11) 



on whether 17AG can cross the blood-brain-barrier, the data and our 
model suggest there is little uptake in brain tissue. It is also likely 
that the measurements of 17AG in brain included drug in the residual 
blood remaining in the tissue sample. The lung 17AG data has a peak 
value at 5 min while the peak concentration of 17AG in venous blood is 
not reached until 30 min, which resulted in a poor fit regardless of the 
models tested. We could not find any physiological evidence to explain 
the lung 17AG data. However, the particular models used to describe 
17AG distribution in brain and lung had little effect on the whole -body 
model predictions, including those in the tumor. 

The Bayesian estimates of all the physiological parameters are reason- 
able given the prior means, except for muscle blood flow. The relatively 
large variability in muscle concentration data, especially that of 17AG, 
may contribute to this discrepancy. 
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Estimation of Intrinsic Clearances and 17AAG 
Metabolism in the Liver 

The intrinsic clearance of 17AAG ( CLintAAG )> the fraction of 
( CLintAAG ) associated with the formation of 17AG (fin), and the in- 
trinsic clearance of 17AG ( CLintAG ) in the liver are especially relevant 
to understanding the distribution, metabolism and hepatic toxicity of 
17AAG. We have established a method to uniquely estimate these pa- 
rameters at the stage of individual organ analysis in the model devel- 
oping process (i.e. using only the liver measurements and the arterial 
concentration profiles). The relationship between intrinsic clearance and 
systemic clearance of 17AAG was derived from a simplified whole-body 
model by solving the model equations in Laplace domain assuming lin- 
ear ki netics and only hepatic elimination. We have proved that the 
CLintAAG- CLsaag relationship depends only on the liver model (work 
not shown), when the remaining systemic organs are lumped into a sin- 
gle composite tissue. The resulting equation relating CLintAAG and 
CLsaag was then included in the liver model as an the additional con- 
straint, thus allowing CLintAAG to be uniquely estimated as part of the 
individual liver modeling procedure. An equation relating CLintAG and 
fin to the systemic distribution of 17AG was also derived in a similar 
fashion to uniquely estimate CLintAG and fin. 

The approach we developed for estimating liver clearance in PBPK 
models has the following advantages: 1) it allows intrinsic clearance and 
its related parameters (such as fin ) to be estimated without constructing 
the whole-body model; 2) it makes it possible to model the un- sampled 
tissues, when significant drug uptake is accounted in those unsampled 
tissues. 

Although the liver model describes the data reasonably well, there 
was a consistent over-prediction of 17AAG in the early time period from 
0-15 min. The highest mean measured concentration of 17AAG in liver 
tissue was 89 /ug/ml at 5 min, while the model prediction at 5 min is 
215 /ig/ml. A number of factors might contribute to this discrepancy. 
Firstly, roughly 75% of the blood entering the liver is the venous effluent 
from the small intestine, stomach, pancreas and spleen, while the model 
assumes no uptake of 17AAG in the small intestine, stomach or pancreas. 
The uptake of 17AAG to those tissues may have a significant effect on the 
drug’s distribution. Secondly, in our investigation of different liver model 
structures, we found that a model consisting of three sub-organ spaces 
(vascular, interstitial and cellular) did not reduce the over-prediction. 
The systemic clearance, defined by the 17AAG data of venous blood 
and other organs, indicated a high transport rate of 17AAG from blood 
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to liver tissue, while the liver 17AAG measurements, especially at early 
times, are consistent with a lower transport rate independent of the 
liver model structure. This observation suggests a discrepancy between 
the information imbedded in liver measurements and that in venous and 
other organ measurements. It is possible that the continuous metabolism 
of 17AAG in liver tissue during the time of animal and organ processing 
contributes to this discrepancy. And lastly, the assumption of a constant 
fin may contribute to the difference between measured and predicted 
17AAG liver concentrations as well. 

Tumor Xenograft Uptake of 17AAG and 17 AG 

One of the attractive properties of geldanamycin derivatives as anti- 
cancer agents is their considerably longer half-lives in tumor compared to 
normal organs/tissues. We were able to describe this property by a three- 
component model that includes vascular, interstitial fluid and cellular 
spaces of the tumor. The model revealed that sustained concentration- 
time profiles of 17AAG and 17AG in tumor tissue was due to a relatively 
slow diffusion across the cell membrane, while in normal organs this pro- 
cess was estimated to be essentially instantaneous. The concentrations 
of 17AAG and 17AG in tumor cells themselves were estimated to be 2.3 
times higher than those in the tumor tissue (in extra-vascular space), 
showing the preferential uptake of both compounds in tumor cells. 

It was found from the data that although the concentration of 17AG in 
blood was generally less than that of 17AAG, the concentration of 17AG 
in tumor was significantly higher (1.5 to 3 fold) than that of 17AAG after 
2 hours following dosing. In the tumor models for 17AAG and 17AG, 
the permeability surface-area product (PS) of the vascular wall was 
estimated to be similar for 17AAG ( 0.23 ml/hr) and 17AG ( 0.26 ml/hr), 
and the transport rate constants of the two compounds from cellular 
to interstitial space (kc,/r) were also estimated to be similar (0.062 
hr -1 for 17AAG and 0.057 hr -1 for 17AG). However, the transport rate 
constant of 17AG from interstitial space to the cellular space (fc/r.c) was 
estimated to be significantly higher than that of 17AAG (0.57 hr -1 for 
17AG versus 0.19 hr -1 for 17AAG). The model estimates suggest that 
the preferential uptake of 17AG over 17AAG in tumor tissue is due to a 
more effective uptake mechanism of 17AG in the tumor cells rather than 
in the interstitial fluid of tumor tissue. 

Pharmacodynamic Model for the Onco-Proteins 

Mechanisms of Raf-1 synthesis, transport, activation and depletion by 
ansamycins have been studied extensively at the sub-cellular and molec- 
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ular level (see Fig. 8). Because the three forms of measurable Raf-1 pro- 
tein were destabilized equally by ansamycins, they were modeled as one 
single pool of Raf-1 with an indirect response model. Although HSP90 
plays a key role in the regulation of Raf-1 stabilization and degradation, 
it was not included in the Raf-1 model, for the following reasons: 1) 
HSP90 is present in the cells in excess (1-2% of total proteins in the cell 
is HSP90), so that all the Raf-1 molecules are chaperoned by HSP90; 2) 
the binding of HSP90 to newly synthesized Raf-1 is essentially instanta- 
neous; and 3) the degradation of Raf- 1 is assumed to be instantaneous 
following the disruption of the HSP90-Raf-l complex by 17AAG and 
17AG. Although the transport and activation pathways of pl85 erbB2 are 
different from those of Raf-1, the action of ansamycin antibiotics on the 
degradation of pl85 erbB2 is the same as that of Raf-1 and therefore was 
also described using the same indirect response model. 

In the model, the synthesis of the onco-proteins was assumed to re- 
main constant following drug administration. However, it was reported 
that the synthesis of Raf-1 in geldanamycin treated cells was elevated 
approximately 3-fold [14]. In addition, in herbimycin A treated SKBr3 
cells (0.35 pM, 6 h), it was reported that pl85 erbB2 mRNA levels were 
elevated by 30% and the total protein synthesis decreased by 16% [29]. 
Although the synthesis of pl85 erbB2 protein was not measured directly, 
the above finding suggests the possible induction of pl85 erbB2 synthesis 
following ansamycin treatment. The mechanism of the synthesis induc- 
tions remains unknown, nor has there been any quantitative analysis of 
the process. With limited data and information, we were not able to 
model the synthesis induction. The discrepancy between measured and 
model predicted pl85 erbB2 concentrations might be a direct result of this 
inability. The data and model prediction suggest that after a period of 
depletion, the fast recovery of pl85 erbB2 is at least partially the result 
of the induced pl85 erbB2 synthesis. 

Pharmacodynamic Model of HSP Auto- Regulation 

The auto-regulation of the heat shock transcriptional response, dis- 
covered in the early 1990s is an example of a self-regulatory biological 
process. While the role of heat shock proteins as molecular chaperons is 
under intensive study, little quantitative information is available on the 
dynamics of HSP auto-regulation. For example, there have been no pub- 
lished reports of the half-life of HSP70 or HSP90, thus the model-based 
estimates obtained from our modeling work represent the first reported 
HSP70 and HSP90 turnover rates (HSP90 atj /2 — 40.43 hr and HSP70 
ti /2 = 4.51 hr). 
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There has been considerable debate regarding the pathways respon- 
sible for mediating the cellular stress signal in the heat shock auto- 
regulatory system [19]. One hypothesis states that external stress alters 
HSF1 directly, while a second postulates that HSP70 serves as the inter- 
mediary between cell stress and HSF1 activation. The model developed 
herein incorporates the later hypothesis, and assumes that HSF1 activa- 
tion increases linearly as HSP70 decreases below its control value with 
that it remains constant as HSP70 increases above the control value. 

The time course of HSPs concentration measurements in this study is 
not long enough to adequately reveal the dynamics of the processes asso- 
ciated with the return of HSP70 or HSP90 to their baseline values. The 
estimated parameter values and the model prediction of HSPs recovery 
reported await confirmation by further in vitro and in vivo studies. 

Bayesian Analysis 

Recent advances in PBPK modeling have further established the rel- 
evance of this approach in drug development. While the MAP Bayesian 
estimation approach used herein does not involve a mixed effects mod- 
eling formulation, it does provide an easily implemented method for 
incoiporating prior uncertainty in PBPK modeling. 

The use of Bayesian inference in model development exposes all parts 
of the model, including the model structure and the prior distribution, 
to appropriate criticism. An example of how the model development 
in the work reported herein benefits from the model criticism aspect of 
Bayesian inference is as follows. One of the models considered for the tu- 
mor uptake of 17AAG and 17AG was a 2-compartment diffusion-limited 
model consisting of an extra-cellular compartment and an intra-cellular 
compartment, i.e., assuming drug exchange across the “leaky” tumor 
vascular wall is instantaneous. This model would have been selected 
as the best model based on the AIC selection criterion, if the physio- 
logical parameters were fixed at their prior values and the kinetic pa- 
rameters were estimated by maximum likelihood estimation. However, 
the Bayesian analysis of this model resulted in unrealistic physiological 
parameter values, suggesting that this 2-compartment model structure 
is inappropriate. Indeed, by including separate vascular and interstitial 
spaces, a 3-compartment model with reasonable parameter estimates 
was established for tumor. The exchange of the compounds across the 
vascular wall was determined by a permeability area-surface product 
estimated at 0.23 ml/hr and 0.26 ml/hr for 17AAG and 17AG. 
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INTRODUCTION 

Population pharmacokinetic analysis attempts to obtain relevant phar- 
macokinetic information in treated subjects who are representative of 
the target population, quantifying variability between subjects and link- 
ing it to factors of demographic, pathophysiological, environmental, or 
drug-related origin. One promising method of analyzing population 
pharmacokinetic data is the expectation maximization (EM) algorithm, 
originally generalized by Dempster, Laird, and Rubin [1], and further 
developed by others [2-5]. The EM algorithm is particularly suited for 
complex PK/PD models with highly dimensioned inter-subject variances 
associated with the model parameters. The EM algorithm in its exact 
implementation calls for obtaining the expected value of certain quanti- 
ties over a probability density, which often cannot be calculated in closed 
form. As a consequence, most implementations of the EM algorithm for 
population PK/PD analysis approximate the expectation step using a 
linearization method [6,7]. This linearized form of the EM algorithm 
is called the iterative two-stage method [3], but it has also been called 
iterative two-stage Bayesian method (IT2B, [8]). If there are abundant 
data per subject, this l in earization yields reasonable values for the model 
parameters. However, if there are very few data per subject, such a lin- 
earization can introduce bias. 

Recently, various Monte Carlo methods have been developed which 
are suitable for conducting the expectation step with reasonable pre- 
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cision and accuracy [9,10]. Walker [11] has used one of these Monte 
Carlo methods to advantage and implemented an EM algorithm pop- 
ulation model. He demonstrated its utility for data in which the pa- 
rameters were distributed among a population of subjects by a multi- 
variate density with no covariances, and a simple homoscedastic intra- 
subject error model. We have further developed this method to include 
more efficient Monte Carlo sampling methods with more complex intra- 
subject error models. We demonstrate its use in complex pharmacoki- 
netic/pharmacodynamic models with parameters distributed by a multi- 
variate prior density with covariances, and we designate this method as 
Monte Carlo Parametric Expectation Maximization (MC-PEM). 

Nonparametric EM algorithms are also available and have the advan- 
tage of generality in that they do not constrain the distribution of param- 
eters among a population to a specific density function [8,12]. However, 
we have found that for most population PK/PD data, assuming a nor- 
mal density or a mixture of normal densities adequately describes the 
population that is studied, and allows for less ambiguous interpretation 
of the data [13], as well as greater speed in the analysis of the data. 
Furthermore, it is precisely this parametric parameter density paradigm 
that is utilized in the very popular NONMEM program [14], and is used 
most often in the pharmaceutical company setting, whereas the non- 
parametric methods are used most often by academic researchers. 

In this paper we provide a review of the equations needed to imple- 
ment the MC-PEM method. We also provide key examples that show 
the properties of this method, and compare them with NONMEM. We 
demonstrate that with proper software implementation one can analyze 
complex PK/PD models reliably, accurately, and quickly, requiring only 
vague initial parameter input by the user. While the statistical and 
theoretical validity of the EM methodology has been demonstrated in 
the literature, the purpose of this paper is to demonstrate the utility of 
the MC-PEM method for the population analysis of complex PK/PD 
models used in the pharmaceutical industry. 



THEORY 

Population PK/PD Data Analysis 

We describe here the two-level hierarchical theory of modeling popula- 
tion pharmacokinetic and pharmacodynamic data, followed by describ- 
ing the EM algorithm and how it can be used to fit such data. We limit 
the implementation of the MC-PEM method to the analysis of pharma- 
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cokinetic and pharmacodynamic data for which the model parameters 
can be assumed to be normally (or log-normally) distributed among sub- 
jects. For such a case, data from a subject consist of measured responses 
at selected times after administration of a drug, such as drug concen- 
tration in plasma, and/or some measure of pharmacodynamic response. 
A pharmacokinetic (PK)/pharmacodynamic (PD) model f(t,0 ) is pro- 
posed to describe the changes in response with time t , where 9 is a vector 
of parameters, such as volume of distribution, clearance, etc. A specific 
function is proposed for each response type, one for drug levels, another 
for PD response, another for efficacy, etc. Typically, the subject’s data 
is modeled to include intra-subject error as follows: 

Vi = f(U,9) + G 1/2 (U,6)£ (1) 

where yi is an m* x 1 vector of the m* total data collected at times U for 
subject i, f(U,9 ) is an m, x 1 vector of predicted values based on the 
family of essential PK and PD functions /(/, 9) specific for each data 
point, G(ti,9) is an mj x m* matrix of modeled (based on predicted 
values intra-subject variances and covariances to the data y t , 

and e is an tth x 1 vector of random values, normally and independently 
distributed with mean of 0 and variance of 1. The negative logarithm 
of the joint probability density for all of the m, data points for subject 
i, not including a constant term, is then (suppressing notation showing 
dependence on ti and 6): 

- log(i(S!il«)) = + \ log |G,| (2) 

where we shall designate l(yi\9) as the lik elihood or data density for data 
vector yi of subject t, given model parameters 9. 

In turn, the variability of the vector of parameters 9 among the pop- 
ulation is also assumed to be normally distributed, or perhaps some 
simple transformation of the parameters is normally distributed (such 
as via logarithmic transformation), with population mean vector /i, and 
population variance matrix ft . The negative logarithm of the probability 
density of a given 9 is therefore, not including a constant term, 

- log (/*(%, ft)) = \ log |ft| + \{9- ii)'Q.- l {9 - (3) 

where we shall designate h (B\y,, 12) as the parameter population density 
for 9, given /i and ft. Then, the joint probability density for some vector 
9 and a set of data yi is 



P(y*i%.ft) = l(yi\9)h{9\y,Q) 



( 4 ) 
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For a particular subject, the data yi is observed and therefore fixed 
throughout the analysis, whereas the parameter vector 9 describing the 
pattern in the data is unknown (and therefore considered “missing” in 
some discussions of the EM algorithm [5]) but imputable, based on the 
model and the observed data. It is therefore best to consider all possible 
values of 9, taking into consideration the probability of occurrence of 
each 9 for the particular population in question. To do so, we integrate 
the density over all possible 9 , producing the following contribution of 
the objective function by subject i: 

/ +oo 

l(yi\9)h(9\n,Cl)dO) (5) 

•00 

which is the negative logarithm of the marginal density of the data yi for 
subject i, up to a constant. The negative logarithm of the joint marginal 
density for all m subjects is then 



m 

L = -log(p(j/|/i,fi)) - ^2 Li 

i= 1 



171 /'+oo 

5>g(/ l(yi\e)h(9\y,n)d9) (6) 

1 J - 00 



where L is the total objective function. 

Linearized Approximation Method of Analysis 

To find the set of mean population parameters /i and population vari- 
ance Q that best fits the data from m subjects, one maxi mi zes the above 
marginal density of y with respect to ji and Q, by minimizing L. How- 
ever, the integration steps needed to evaluate L are computationally 
expensive. The program NONMEM, first order conditional estimation 
method (FOCE, [14]) minimizes an alternative objective function, which 
is a linearized approximation to 2 L: 



Ln = -2 log (p(Vu ft)) - JZ lo g 



1=1 



1 = 1 



B, 



( 7 ) 



where §i is the mode of the joint density for each subject i, and B t 
is the first order approximation to the variance of the parameters over 
the joint density. As we shall see in the examples, this approximation is 
reasonable when plenty of data are available for each subject (rich data), 
but leads to biased values when there are few data per subject (sparse 
data). 
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The EM Algorithm and Monte Carlo Integration: 
MC-PEM 

The EM algorithm is another means by which the objective function 
L may be minimized. It can be shown that if the parameter population 
density h(9\p, Cl) is of the form of a multivariate normal distribution 
with respect to 9 (or some transformation of 6), then at the minimum 
of the objective function the following relationships hold [4]: 





i m 

/* = — xi ^ 

m *r~i 

t=i 


( 8 ) 




1 m 

n = -YCii 

m ^ 

»=t 


( 9 ) 


where 


rOO 






9i= 0z(0\yi, p,tt)d0 

J — OO 


( 10 ) 


is the conditional mean 


9 vector for subject i, 






f oo 

(9 - p)(9 - pYz(0\yi,p , Q)d0 

-oo 


( 11 ) 



is the contribution to the population variance from each subject i, and 



z(9\yi,fi,n) 



l{yi\6)h{0\n,n) 

J!SoJ(wlW%.n)d0 



( 12 ) 



is the conditional density of 0, given data yi, and population parameters 
p and Cl. An equivalent form of Eq. (11) is: 

Cli = ( 1 9i - n)(0i - p)' + Bi ( 13 ) 



where 



Bi 



/ +oo 

(9-0i)(0-0 i Yz(0\yu^n)d9 

-oo 



( 14 ) 



is the conditional variance matrix of 9 for subject i. 

The above equations suggest that one may maximize p(y\p, Cl) with 
respect to the population parameters p and Cl by first evaluating the 
conditional mean 0 { by Eq. (10) and the conditional variance Bi by Eq. 
(14) for each subject i, using fixed values of p and Cl (the expectation 
step), followed by evaluating updates to p and Cl using Eqs. (8) and (9) 
(the maximization step [4]). The EM update Eqs. (8) and (9) are specific 
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for a parameter density h(0\fi, Q) that is normal, but are generally true 
for any data density l(yi\Q) , such as, log normal distribution, logistic 
regression, etc. For purposes of this paper, we focus on that are 

normal densities with respect to the observed data j/j. 

Additionally, we would like to evaluate the integrations given in Eqs. 
(10) and (14) more precisely while avoiding complicated integration algo- 
rithms. Monte-Carlo integration methods allow one to randomly sample 
over the entire space of 0, and then calculate a weighted average of the 
quantity of interest, converting the theoretical integration step into a 
practically evaluated summation step: 



^ £>(((#<»,), A(«, t) )) 

fc=l 

Theoretical Expression Practical Evaluation 

and 

J'ZoP(y^ e \^ n ) de 

Theoretical Expression 

(0(k) ~ #i)(0(k) ~ 0i)'w(l(0(k)),h(0(k))) 

— ? (16) 

Zw(l(0( k) ),h(0 {k) )) 

k = 1 

Practical Evaluation 

where the weight w depends on the parameter density h(0^) (suppress- 
ing dependence on p and O), data density, 1(9 (*)) (suppressing depen- 
dence on yi ) and the method of Monte Carlo used, for r randomly gen- 
erated vectors of 9^y Appendix A1 gives the weight equations for two 
methods that we have used, the direct sampling and the importance 
sampling techniques. The value of r required depends on the desired 
precision of the population parameter estimation, the type of data, and 
the type of PK/PD model. Typically, 1000 to 3000 randomly generated 
samples of 9^) are needed to obtain approximately three significant dig- 
its of precision in the conditional mean and variance parameters. 

Covariate parameters 

The population parameters fj, may in themselves be modeled from 
covariate parameters c, which link them to characteristics (covariates) 
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of the subject. For example, volume of distribution V c may be modeled 
to age: Va = c s * agti + q,. In this case, c must be varied to maximize 
the objective function, rather than the m, which now varies with each 
subject. For the EM method, this means modifying the maximization 
step, as described in Appendix A2. 

Intra-subject variance parameters 

The objective function may be further optimized by a set of intra- 
subject variance parameters that are unknown but fixed between sub- 
jects. For example, the intra-subject error may be modeled as 

9 ij=cr 2 f{tij,ey (17) 

where pjj is the intra-subject variance of the jth data point of subject 
i , and f(Uj,0) is the predicted value at time Uj. In this example, the 
parameters a 2 and 7 are the intra-subject variance parameters, and may 
be referred to generally by the vector v. The SIGMA intra-subject error 
parameters of NONMEM are v parameters. Appendix A3 describes a 
general Monte Carlo method for optimizing the objective function for 
these intra-subject variance parameters. 



HARDWARE AND SOFTWARE 

All computations were performed on a PC with 1 Gigahertz CPU 
speed, 512 MB RAM, using the Windows 98 operating system. The 
ADAPT II, Release 4 program [15] was modified to perform the MC- 
PEM population analyses, utilizing the mathematical algorithms that 
were available from the original software. We used the Nelder-Mead al- 
gorithm supplied by ADAPT II for all of the deterministic optimizations. 
Digital Visual Fortran (Compaq Corporation) was used as the compiler. 
The program NONMEM version V, level 1.1 [14] by Stuart Beal and 
Lewis Sheiner was obtained from U. California, San Francisco. The ran- 
dom number generator used for the generation of data and analysis of 
MC-PEM was ^R AN 3 from [16]. 
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EXAMPLES AND RESULTS 

Example 1: Test of Bias and Precision of MC-PEM 
Method 

Bias and precision of the direct sampling MC-PEM method under 
sparse data conditions were tested by creating 50 data sets with 1000 
subjects each, one data point per subject. We began with the structural 
model 

/ (it) = 100 * exp (-kxo * t)/V c 

where fc 10 is the elimination rate constant and V c is the volume of dis- 
tribution of the central compartment, with intra-subject variance: 

9i = v 2 * f{t) 2 

and a — 0.2, fixed. The simulation of the 50 sets of data was performed 
using the following attributes for each data set: 1000 subjects (simu- 
lated vectors of PK parameters), 1 PK observation per subject at 0.1, 
3, or 15 units of time, 1 bolus dose of 100 units given to each subject. 
Each parameter (PK) was simulated to follow a lognormal distribution 
among the population. The population parameters were generated us- 
ing population values F c =70, AqO^O.2, Var(V c )=0.3, Var(fci0)=0.7, and 
Covar(V c , feio)=-0.4. For each data set of 1000 subjects an MC-PEM 
analysis was performed. 

For each EM iteration, 1000 random samples of parameters were gen- 
erated for each subject, to determine their conditional mean and vari- 
ance by the direct sampling method. The analysis was allowed to run for 
25 EM iterations for each data set. From these 50 MC-PEM solutions 
(one for each data set), imprecision and bias statistics were evaluated as 
shown in Table 1, and also plotted as scatter grams in Fig. 1. 

The reference means are those of the simulated values actually gener- 
ated for each of the 50 data sets. The mean estimates are of the 50 sets 
of estimates of the population parameters generated by the MC-PEM 
analysis. The empirical coefficient of variation (CV) among the 50 MC- 
PEM population parameter estimates represents the imprecision of the 
MC-PEM method. The Fischer score coefficient of variation is the aver- 
age percent standard error of the population parameters among the 50 
data sets. The Fischer score standard error was evaluated based on the 
empirical observed Fischer information matrix described on p. 129 of [17] 
and Eq. (4.41) of [18]. For the Fischer score error to be useful, it should 
be similar to the empirical CV of the estimates. The confidence index 
is the percent of the 50 estimates that were within 2 Fischer score stan- 
dard errors of the reference parameter values. For normally distributed 
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liable 1. Test of bias and precision of MC-PEM (example 1). 





Vc 


&10 


1 Var(V c ) 


Covar(fcio,V c ) 


Var(fcio) 


Mean of Sample References 
(known) 


70.0 


0.200 




-0.398 


0.696 


Mean of Estimates (found) 


69.9 


■i»w 






0.699 


Empirical CV of Estimates 
(%) 


2.40 


4.80 




8.72 


8.77 


Fischer Score CV of Esti- 
mates (%) 


2.30 


3.75 


7.04 


8.29 


tm 


Percent Bias 


-0.0961 


0.515 


0.226 


ESEjMStlif 




Standard Deviation of Per- 
cent Bias 


1.30 


2.76 


5.43 


6.65 


7.08 


Confidence Index (%) 


too 


100 


100 


100 


100 



Each of fifty simulated data sets was analyzed by MC-PEM, yielding estimates for 
each of the population parameters and variances, as described in example 1 in the 
text. The summary statistics listed in the table were calculated from the estimates 
and reference values in accordance with equations listed in Appendix A4. The Con- 
fidence index is the percent among the fifty analyses that yielded estimates within 
two Fischer score standard errors of the reference values. 



estimates, one would expect that about 95% of the estimates fall within 
2 standard errors of the true values. This index was calculated because 
users of curve fitting software often expect, as a rule of thumb, that the 
estimate should be within two standard errors of the true value, whether 
or not the estimates are in fact normally distributed. The equations for 
calculating each of these statistics are given in Appendix A4. 

As shown in Table 1, the percent Bias was less than 1% for all of 
the population parameters, and much less than the empirical coefficient 
of variation of 3-8%. Furthermore, the average Fischer score CV was 
very similar to the empirical coefficient of variation, suggesting that the 
Fischer score method is reliable for standard error estimation of the 
population parameters. The confidence index, evaluated from the 50 
data sets, was 100%, suggesting that for any given data set, the user may 
accept with a high degree of confidence that the estimated population 
parameters are within two estimated standard errors of the true values. 
Similar results were obtained when importance sampling was used. 

For NONMEM conditional estimation method with interaction (Ta- 
ble 2 and Fig. 2), the average population V c among the 50 data sets was 
upward biased by 11% and the fcio was upward biased by 8.3%, which 
are 3-4 fold greater than the CV of the estimates. The inter-subject 
variances were 0.5% biased for Var(V^), and 9-10% biased for Var(klO) 
and Covar(l4,jfcio). Fischer score standard errors from NONMEM were 
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Bias and Precision of MC-PEM 




Replication Number Replication Number 




Replication Number 




Replication Number 




Fig. 1 . Bias and precision of MC-PEM results obtained from the 50 simulated data 
sets for example 1. The percent difference of each population parameter estimated 
using MC-PEM was then calculated, relative to the known simulated mean of the 
data set (the reference value, see Appendix A4) and plotted against the data set 
(or replication) number. The percent bias is the average percent difference from 
reference (Appendix A4), indicated by the solid line, and is to be compared against the 
dotted zero bias line. The dashed lines indicate the empirical 2.5 and 97.5% quantile 
positions, representing the scatter of the values. See Table 1 for the numerical values 
of the percent bias and percent imprecision. 





%Difference from Reference %Difference from Reference %Difference from Reference 
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Bias and Precision of NONMEM, FOCE 




Replication Number Replication Number 




Replication Number Replication Number 




Fig. 2. Bias and precision of NONMEM with FOCE results obtained from the 50 
simulated data sets for example 1. See Table 2 for the numerical values of the percent 
bias and imprecision and the Fig. 1 caption for further description of the graphs. 
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Table 2. Test of bias and precision of NONMEM, FOCE (example 1). 





1 Yi . 


| fcio 


Var(Vl) 


| Covar(fcio,V c ) 


Var(fcio) 


Mean of Sample References 
(known) 


m 




0.301 




0.699 


Mean of Estimates (found) 


77.9 


0.216 


0.299 




0.635 


Empirical CV of Estimates 
(%) 


2.79 


4.43 


9.60 


10.5 


9.25 


Fischer Score CV of Esti- 
mates (%) 


2.54 


3.65 


7.07 


7.96 


7.77 


Percent Bias 


11.0 


8.30 


-0.512 


10.2 


-9.15 


Standard Deviation of Per- 
cent Bias 


2.10 


3.33 


9.00 


8.91 


7.64 


Confidence Index (%) 


0 


34 


90 


72 


78 



Each of fifty simulated data sets was analyzed by NONMEM’s first order conditional 
estimation (FOCE) method, yielding estimates for each of the population parameters 
and variances, as described in example 1 in the text. The summary statistics listed 
in the table were calculated from the estimates and reference values, in accordance 
with equations listed in Appendix A4. Fischer score standard errors from NONMEM 
were obtained using the MATRIX=S switch. The Confidence index is the percent 
among the fifty analyses that yielded estimates within two Fischer score standard 
errors of the reference values. 



obtained using the MATRIX=S switch. The confidence index for the 
population parameters were 34% or lower, meaning that the population 
estimates were typically more than 2 reported standard errors from the 
true values. Thus, a user can be misled as to the location of the true 
values of the population parameters when analyzing this type of data 
set with NONMEM FOCE. The problem was run again by using the 
Laplace conditional estimation method (Table 3 and Fig. 3, using the 
2LL” option in NONMEM), which resulted in a reduction of bias in the 
population mean parameters to 2-3%, but the biases in the population 
variances increased to 9-32%. Only 34 of the 50 data sets were success- 
fully analyzed by the NONMEM Laplace method. The traditional first 
order (FO) method yielded values similar to the Laplace method: 3-4% 
bias in the population mean parameters V c and fc 10 , and 10-34% bias in 
the population variance parameters (data not shown). However, NON- 
MEM was able to analyze a data set in 30 seconds or less, whereas the 
MC-PEM method took about 15 minutes. We typically found that for 
simple pharmacokinetic models, NONMEM was 10-30 fold faster than 
MC-PEM. 
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Bias and Precision of NONMEM, Laplace 





Replication Number Replication Number 




Replication Number 



2.5%, 97.5% Quantiles 

— Bias 



Fig. 3. Bias and precision of NONMEM with Laplace results obtained from the 
50 simulated data sets for example 1. See Table 2 for the numerical values of the 
percent bias and imprecision and the Fig. 1 caption for further description of the 
graphs. Only 34 of the 50 data sets were successfully analysed by the NONMEM 
Laplace method. 
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Table 3. Test of bias and precision of NONMEM, Laplace (example 1). 





V e 


k 10 


1 Var(V c ) 


Covar(A'io,E:) 


Var(fcio) 


Mean of Sample References 
(known) 


70.1 


0.200 




-0.398 


0.699 


Mean of Estimates (found) 


72.2 


0.204 


0.271 


-0.270 




Empirical CV of Estimates 
(%) 


4.63 


6.36 


15.8 


17.9 


33.7 


Fischer Score CV of Esti- 
mates (%) 




2.61 


4.31 


3.70 




Percent Bias 


2.98 




-9.44 


32.1 


-29.4 


Standard Deviation of Per- 
cent Bias 


4.34 


6.28 


14.4 


12.5 


21.5 


Confidence Index (%) 


53 


17 


47 


0 


0 



Each of fifty simulated data sets was analyzed by NONMEM’s Laplace conditional 
estimation method, yielding estimates for each of the population parameters and 
variances for 34 successful analyzes, as described in example 1 in the text. The 
summary statistics listed in the table were calculated from the estimates and reference 
values, in accordance with equations listed in Appendix A4. Fischer score standard 
errors from NONMEM were obtained using the MATRIX=S switch. The Confidence 
index is the percent among the fifty analyses that yielded estimates within two Fischer 
score standard errors of the reference values. 



Example 2: Complex PK/PD model 

To illustrate the functionality and robustness of the methodology, we 
simulated data from a two-compartment PK model linked to a Michaelis- 
Menten PD model. Such a model is sufficiently complex to demonstrate 
the utility of EM methods, as there are a total of 44 population mean and 
inter-subject variance-covariance parameters for this model. A diagram 
defining the model and the parameters are given in Fig. 4, with the 
equations describing the system in Appendix A5. 

The simulation of the data was performed using the following at- 
tributes: 500 simulated vectors (subjects) of PK/PD parameters, 3-4 PK 
observations per subject (sparse data), 34 PD observations per subject, 
a bolus dose of 500 units given to each subject, each parameter (PK) 
was simulated to follow a lognormal distribution, intra-subject error for 
drug levels was modeled with constant proportional error, and the PD 
response was modeled with constant error. The population mean and 
variance values generated for this sample data set are listed in Table 4. 

The initial values of the population parameters were set at the begin- 
ning of the MC-PEM analysis to 1.1. These values were arbitrarily differ- 
ent from the simulated values of w c =50; fci2=0.94; &2i=0.313; fcio=1.4; 
k eo — 0.1; C eo 5O =0.02; E max = 12; 7=1.5. Diagonal elements (variance) 
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liable 4. Reference (simulated) population means and variances generated for the 
sample data set of the complex PK/PD model with sparse data (example 2). 



Parameter 


Mean 


Pc 


50.1 


k\2 


0.956 


&21 


0.319 


kio 


1.39 


keo 


0.100 


Cc50 


0.020 


Emax 


12.4 


7 


1.59 


ffpx (proportional error) 


0.25 


cpD (constant error) 


0.5 





Pc 


k \ 2 


kn 


fclO 


fcco 


Cc50 


Emax 


7 


Vo 


0.234 
















k\2 


0.00483 


0.0822 














&21 


-0.00545 


0.0381 


0.309 












k\o 


-0.338 


0.00753 


0.0282 


0.528 










keo 


0.00593 


0.0312 


0.204 


0.0266 


0.421 








CeS 0 


0.00489 


0.0530 


0.113 


-0.00545 


0.0944 


0.563 






Emax 


0.00255 


0.0495 


0.120 


0.0363 


0.103 


0.524 


0.571 




7 


0.0111 


0.0679 


0.0967 


-0.112 


0.517 


0.183 


0.0805 


0.493 



Simulated geometric means and variances of the parameters far 500 subjects using a 
two-compartment PK linked to a sigmoidal Emax PD model are shown. The model 
is described in Fig. 4 and Appendix A5. 



of the population variance-covariance matrix were set to 3, off-diagonal 
elements (covariance) were set to 0. Rather than listing the 44 resulting 
population parameters and variances for each analysis method. Table 
5 describes the overall experiences and bias results of the fits. The 
MC-PEM method using the importance sampling method took 100 it- 
erations beginning with the initial parameters given above, with 3000 
random samples per subject generated for the expectation step, before 
none of the parameters changed consistently from one iteration to the 
next, and it took about 4 hours to complete the analysis. No user inter- 
vention was required in the process. All but 1 of the MC-PEM estimate 
values were within 2 standard errors of the simulated sample: the pop- 
ulation estimate of the parameter 7, the slope to the Emax model of the 
PD response, was 2.2 standard errors from the known simulated mean 
value. 
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Table 5. Optimization experiences and results for complex PK/PD model with 
sparse data (example 2). 





MC-PEM 


NONMEM 


Total time (hours) 


3.8 


8.5 CPU/ll user 


No. of interventions 


0 


5 


No. of parameters with bias 


1 : 7 


1 : 7 


No. of population variances with bias 


0 


3^0*10,7 


No. of population covariances with bias 


0 


5:/uiO"Vc ? ^max"&21 ? 
7-kl2,7-fc21 ,7 -Emax 


Total number of biased estimates 


1 


9 



General results of the population analysis of simulated data using a two-compartment 
PK linked to a sigmoidal Emax PD model are shown. The model is described in Fig. 
4 and Appendix A5. Bias was assessed as greater than 2 standard errors from the 
reference sample mean value. Of those that were more than 2 SE from sample mean, 
none were greater than 3 SE for MC-PEM, 1 was greater than 3 SE for NONMEM. 



In executing the NONMEM program, errors were reported on the 
first iteration using the initial values that were used in the MC-PEM 
analysis (except that the off-diagonal elements were initially set to 0.05). 
Therefore, the results after the 5th iteration of the MC-PEM method 
were instead used as initial parameters, and NONMEM was able to 
continue for 5 hours before reporting another error. The final parameters 
of an incomplete run were used as initial parameters for a new run, with 
slight modifications. This method of restarting the process was done 
5 times, 2 of which were not useful in reducing the objective function 
value. The total computation time was 8.5 hours, as listed in Table 5, 
and 11 hours if we include time spent by the user to re-start each run. 
Even at the last run, NONMEM reported an error. But, as the result 
was sufficiently close to the known answer, the values of the last run 
were evaluated. This exercise clearly indicated the lack of robustness of 
the NONMEM method for this type of example. Nine of the parameters 
were more than 2 standard errors from the true sample means. 

Example 3 

The following example demonstrates the ability of the MC-PEM meth- 
od to analyze data in which the PK parameters are modeled with respect 
to a subject’s characteristics, and also shows the ability to obtain the 
intra-subject variance parameters. The model was defined as a simple 
one-compartment model: 



/ (t) - 100 * exp (-k w * t) /V c 
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A Complex PK/PD Model 




Fig. 4. A Complex PK/PD model. Population data were simulated in accordance 
with the model shown in the figure. The plasma concentration of the drug was sim- 
ulated using a two compartment model with first order distribution rate constants 
kl2 and k21 between the central and peripheral compartments, rate constant of elim- 
ination klO, and volume of central compartment V c . The concentration of drug in 
the effect compartment C e (t) was modeled as driven by the plasma concentration of 
the drug via the delay link rate constant k eo , which in turn determined the pharma- 
codynamic effect E(t) modeled as the sigmoidal Emax model, with maximal effect 
E max , half-maximal effect concentration EC50, and slope factor 7. The specific model 
equations are given in Appendix A5. 

where V c and /c 10 are modeled to the subject i’s age: 

Vd = AV * agef v 
ki 0i = AK * agef K 

and the in tra- subject variance was modeled as g% = o 2 * /(<) 7 where <7 
and 7 are fixed among the subjects ( v parameters), but unknown. 

The simulation of the data was performed using the following at- 
tributes: 100 subjects (simulated vectors of PK parameters), 6 PK ob- 
servations per subject (rich data in order to evaluate intra-subject er- 
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Table 6. Results of MC-PEM analysis of covariate model with rich data (example 
3). 



Parameter 


Sample (Reference) 


Estimated 


SE 


%CV 


AV 


4.48 


4.79 


0.77 


16.2 


BV 


0.81 


0.795 


0.043 


5.45 


AK 


0.00236 


0.00249 


0.000586 


23.5 


BK 


1.25 


1.229 


0.0595 


4.84 


(T 


0.2 


0.1928 


0.0064 


3.32 


7 


2 


1.97 


0.04 


1.86 


Var(Vc) 


0.06 


0.0553 


0.0118 


21.4 


Cov(kiOV c ) 


-0.02 


-0.0236 


0.0086 


36.7 


Var(fciO) 


0.07 


0.0645 


0.0122 


18.8 



Data were simulated using a one compartment model with population volume of 
distribution and rate constant of elimination varying as a power function with age, as 
described in example 3 in the text. The parameters to the power function for volume 
were AV for proportionality coefficient and BV for the power. The parameters to the 
power function for rate constant of elimination were AK for proportionality coefficient 
and BK for the power. The intra-subject random error in the data was modeled as 
a heteroscedastic power function, with coefficient sigma and power gamma. Inter- 
subject variance elements are Var(Vc), Cov(klO,Vc), and Var(klO). Results from 
NONMEM FOCE were similar. 



rors), a bolus dose of 100 units given to each subject, each parameter 
(PK) was simulated based on a lognormal distribution. 

For the analysis, one thousand random samples per subject were 
generated during the expectation step using the importance sampling 
method, and about thirty iterations were required to reach convergence, 
when begun at starting values of 1.1 for all population parameters, di- 
agonal population variances=2.0, off-diagonal variances=0, sigma=0.5, 
and gamma=1.0. 

The analysis results are shown in Table 6. The NONMEM program 
could not complete the problem at these initial values. Only after NON- 
MEM was given initial parameters that were close to the answer could it 
converge to an error- free result. The NONMEM method yielded similar 
values to those of the MC-PEM method. 



DISCUSSION 

The results of Example 1 demonstrate that under sparse data condi- 
tions, the linearized approximation methods such as NONMEM can pro- 
duce considerable bias in the population parameters and/or population 
variances, although it was 10-30 fold faster than the MC-PEM method. 




153 



Linearized approximation methods use the mode 6 X of the joint density 
l(yi\9)h(8\fi,Q) of the parameters, and the first order approximation 
to the conditional variance, B x , instead of the more proper conditional 
mean 0 X and conditional variance B, , respectively. Only when the joint 
density is truly normally distributed with respect to the parameters do 
the relations 6 t =9 l and B x = B % hold true. Under these conditions, L/v 
of Eq. (7) is exactly equal to 2 L of Eq. (6). This occurs when the model 
function / (t, 0) is a linear function of the parameters 9, and the intra- 
subject error is homoscedastic. We have found that when there are very 
few data per subject, the joint density tends to deviate from the nor- 
mal density considerably. The more data for a given subject, the more 
normal-like the joint density appears with respect to the parameters. It 
should be noted that the iterative two-stage method [3], which uses the 
mode 9{ and approximate variance B % in an EM algorithm, also leads 
to biased estimates of population parameters and variances for exam- 
ple 1, although the values differ from that of NONMEM FOCE. Thus, 
evaluating the conditional means and variances using the Monte-Carlo 
integration method, given a sufficient number of random samples, results 
in unbiased values for the population parameters and variances. 

It should be noted that certain study designs can result in a biased 
assessment of the data regardless of the statistical analysis method used, 
hi this example we selected a set of time points that were reasonably 
distributed to cover most of the V c and fcio values that would occur 
among the population. That an analysis method, in this case MC-PEM, 
was able to obtain the population parameters with less than 1% bias 
demonstrates that the simulated data sets were based on a balanced 
study design. If the range of time points were too narrowly selected, 
any analysis method, including MC-PEM, would result in biased values. 
Thus, a proper study design is also important in obtaining unbiased 
estimates. 

It has been pointed out to us that the type of random number genera- 
tor used to simulate the data could affect the apparent performance of a 
particular method. We repeated these analyses by generating data using 
several types of random number generators, as well as from NONMEM’ s 
simulation step, and similar results were obtained: typically < 2% bias 
in the population mean and variance parameters for MC-PEM, and con- 
siderably greater bias from the various NONMEM methods (data not 
shown). 

The results of Example 2 show that the MC-PEM is more efficient and 
has fewer problems in converging to the final estimate when analyzing 
population data with complex PK/PD models. It is the simplicity of 
the maximization step of the EM algorithm that allows for this kind of 
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robustness. Traditional optimization methods have difficulty optimizing 
an objective function that is as complex as those for a population analy- 
sis problem, particularly when population variance parameters must also 
be estimated, and programs that rely on such techniques have difficulty 
converging when the PK/PD model is complex. We found this to be 
true of the mixed effects program in S-PLUS [19], as well as WinNon- 
Mix [20]. To aid in the convergence, it is recommended that some or 
all off-diagonal elements of the inter-subject variance-covariance matrix 
be constrained, which tends to lead one to make arbitrary constraints. 
In contrast, the stability of the MC-PEM method is not affected by size 
of the inter-subject variance matrix, and easily allows the evaluation of 
all of its elements. After performing the MC-PEM analysis, compari- 
son of the variance element with its standard error will reveal to the 
user its true importance to the fitting of the data. This is preferable 
to fixing that element before the analysis, which can introduce bias. 
The most complex model that we have tested to date consisted of real 
clinical data using a PK/PD/efficacy model containing six differential 
equations, 16 population parameters, 4 intra-subject error coefficients, 
and 136 variance-covariance parameters. Three differential equations 
were used to describe the pharmacokinetics of the drug, with drug in- 
puts by intravenous and subcutaneous routes, and Michaelis-Menten ki- 
netics. The PD component used 2 differential equations, and reduction 
of PD marker by drug was modeled using indirect response models I 
and IV [21]. Finally, one differential equation was used to describe the 
disease related to the PD marker, offset by first order rate of healing. 
The MC-PEM successfully reached convergence in fifty iterations with 
no user interruptions required, although it took four days because of the 
required numerical integration of the model equations. 

The MC-PEM method is readily extended to analyze covariate pa- 
rameters and in tra- subject variance parameters. Furthermore, standard 
errors of the population parameters can be easily calculated after a given 
run using the empirical information matrix [17,18]. The method can be 
made to be as flexible as the NONMEM method in allowing the user to 
design a wide variety of inter- subject and intra-subject variability models 
to describe population PK/PD data. We have even determined a way to 
model inter-occasion variability [22] using the MC-PEM method, which 
is outlined in Appendix A6. Additionally, we have modified l (yi\0) to 
take into account data that are below the quantification limit of the as- 
say, using the M3 method described by Beal [23]. Simulations with as 
much as 10% of the data below the quantifiable limit followed by MC- 
PEM analysis yielded unbiased estimates of the population parameters. 
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Our implementation of MC-PEM also calculates the objective function 
that is suitable for hypothesis testing. 

Despite NONMEM’s bias in sparse data problems and its lack of sta- 
bility with complex PK/PD problems, NONMEM is faster than MC- 
PEM and just as accurate when the data per subject are rich and the 
PK model is relatively simple, such as a one or two compartment model. 
NONMEM is also efficient at extensive covariate modeling of the popula- 
tion parameters and statistical testing of relevant subject characteristics. 
It is with these kinds of models that NONMEM has been most success- 
fully used. Furthermore, typical of deterministic methods, NONMEM’s 
objective function is calculated to a higher precision than Monte Carlo 
methods, which allows for a clear stopping criterion, unlike our MC- 
PEM method, which at present requires monitoring of the change in the 
population parameters by the user, and thus a somewhat subjective as- 
sessment of convergence, although more objective stopping criteria have 
been suggested [18]. 

In summary, the MC-PEM method offers a practical alternative to 
population analysis for situations where NONMEM tends to fall short, 
namely, for sparse data and/or complex PK/PD models. With the im- 
provements described above implemented in appropriate software, we be- 
lieve the MC-PEM method can be an important addition to the current 
population analysis arsenal. The implementation of the MC-PEM as 
outlined in this paper is being evaluated for incorporation into ADAPT 
II for general distribution. 
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APPENDIX A1 



Weight Equations for Direct Sampling and Importance 
Sampling Monte Carlo Method 

For the following discussion, we define the normalized weight to a 
random sample k for data from subject i as 



u (k)i ~~ 



E «W(fc))»M0(fc))) 

k = 1 



(18) 



Direct Sampling 

Direct sampling for an EM algorithm has been described in section 
4.1 of [91 and utilized in a pharmacodynamic population problem by 
Walker [11]. This simplest of Monte Carlo techniques is to sample from 
the parameter density and weight in proportion to the subject’s data 
density, so that 



u (k)% = 



Kyi\Q(k)) 

E Kvt\9(k)) 

k = 1 



(19) 



is the normalized weight for the kth random sample vector of r total 
samples. 



Importance sampling 

The MC-PEM using direct sampling can be inefficient for certain prob- 
lems. Typically, 30000 samples or more per subject are required in 
sampling from the parameter density. An alternative method uses im- 
portance sampling, but requires a greater degree of programming. We 
utilize an envelope or sampling function p e (9) from which one obtains 
sample values of 9. Then, the normalized weight to use for each random 
sample vector is: 

l(y i \9)h(9\n ) n)/pe(9) , nnS 

u (k)i ~ (20) 

E l(ViW)h(0\t*,to)/Pe(9) 

k= 1 

One reasonable envelope function is the multivariate normal density that 
approximates the joint density, l(yi\9)h(9\p, Q), although a multivariate 
t-distribution may also be used to efficiently cover long tailed joint densi- 
ties [10]. This multivariate density may utilize the mode of the subject’s 
joint density as its mean, and an estimate of the variance-covariance 




159 



based on the first order information matrix. Such a method requires the 
maximization of the subject’s joint density using a deterministic opti- 
mization algorithm (such as Nelder-Mead or Quasi-Newton methods). 
Alternatively, after the first EM iteration, the multivariate normal den- 
sity using the conditional mean and conditional variance of each subject 
from the previous EM iteration may serve as a sampling function for the 
present iteration, thus no longer requiring the use of a search algorithm 
to find the mode of the subject’s conditional density. 



APPENDIX A2 

Evaluating Covariates in the EM Algorithm 

In general, the population parameter // may be a function of covariate 
coefficients c and covariate data X{ for a given subject: 

m = ( 21 ) 

For EM methods, a least squares evaluation of the objective function 

m 1 

^-(^-MO'n-^i-Mi) (22) 

t=i 1 

with respect to the covariate parameters c serves as the maximization 
step. This replaces the update (maximization) of Eq. (8), and was 
recommended by Mentre and Gomeni (Eq. (13) of (7]). The inter- 
subject variances are updated (maximized) in the usual manner by Eq. 
(9). To carry out the estimation of the covariate parameters, the Oi 
obtained from the expectation step are made available as fixed data to 
the least squares routine in the maximization. Thus, the optimization of 
the objective function in Eq. (22) with respect to the covariates requires 
little computation time compared to the expectation step, which had to 
generate the by performing Monte Carlo simulations and calculating 
a possibly complex PK/PD model. 



APPENDIX A3 

Monte Carlo Assessment of the Intra-Subject Variance 
Parameters 

Intra-subject variance parameters v are associated only with the sub- 
ject’s data density, and not the parameter density. Thus, to optimize 
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the population objective function in v , we need differentiate only the 
data density portion l(yi\6,v): 



dL 

dv 



™ pec 

^2 / {-d\og(l(y i \e,v))/dv]z(e\y i ,n,n } v)dd 

1 J-OQ 



(23) 



which for the Monte Carlo implementation of the EM method leads to 
the practical evaluation during the expectation step of: 



= E £ “<*)4 ir ( G >' fr 0 -"' t ft - (» - /<)(*» - /<)'] ) (24) 

J i=l fc=l v r / 



for element i of the vector dL/dv, where u^)i is the normalized weight 
of the A;th randomly generated sample of vector 0. Typically, the intra- 
subject variance matrix G is modeled as a simple function of the PK/PD 
model /and the variance parameter v, where /does not depend on any 
parameters in v. In such cases, can be evaluated numerically by 

the finite difference method by varying only v, while keeping/, which 
varies only with 6^)i fixed. This saves considerable computation time, 
since the PK/PD model / can be very complicated. 

The second derivative matrix may be approximated by 




for element j i, j 2 of the Hessian matrix H v> so that after evaluating for 
all of the subjects a Gauss-Newton update may be implemented for the 



maximization step: 



Vnew — v old 



( 26 ) 



serving as the maximization step for the v parameters. 

The above method is only useful for fixed parameters that are as- 
sociated with the intra-subject variance function. However, it extends 
the EM algorithm to the accurate evaluation of variance parameters for 
intra-subject variance functions of any complexity, without “linearizing” 
the problem (such as Eq. (17) of [7]). Schumitzky [4] recommended a 
general method of updating any fixed parameter that may belong to the 
PK/PD model or to the intra-subject variance model that does not have 
a parameter density associated with it. This consists of modeling these 
parameters as having a common but unknown mean among the sub- 
jects, with very small inter-subject variance. In our experience we have 
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found for one example that fixing the inter-subject variance to a value 
of 15% of the mean of this parameter allowed the mean of the parameter 
to improve with each iteration, while allowing only small variations in 
value between subjects, thus ensuring that the parameter has a nearly 
common value among all of the subjects. This general method yielded 
the correct parameter value, but required over 100 iterations to achieve 
convergence. The method we have shown above does not require the 
use of a nuisance parameter (the small variance attached to the param- 
eter of interest), and required about 30-40 iterations for the particular 
problem that we studied. We thus believe our method above is a useful 
improvement. 



APPENDIX A4 

Precision and Bias Statistics for Tables 1 through 3 of 
Example 1 

In the following definition, Ei represents the population mean or vari- 
ance parameter estimated by MC-PEM or NONMEM for the ith data 
set, and Ki the known simulated geometric mean population parameter 
(reference) or the known variance for the ith data set, and n the total 
number of data sets. The following quantities are used to analyze the 
results presented in the example. 

The mean of sample references: 

* < 27 > 

1=1 

where g(x) = ln(x) for population parameters V c and k\o, and g(x) = x 
for population variances Var(V^), Var(fcio), and Covar(E c , k\o). 

The mean of estimates: 

E = g- l (^jts(E<)) (28) 

i=l 

The empirical CV of estimates: 

i/e («(£y -s(£)) 2 /(n- 1) 

100 V *=i 

E g>(E) 

where g'{a ) = (¥) l=a ’ 9(x) = ln(x), and g'(E) = 1/E. 



(29) 
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Fischer score standard error, s *, is obtained from the inverse of the 
observed Fischer score matrix (p. 129 of [17]). The Fischer score CV of 
estimates: 

-E# (30) 

n Ei 



i= 1 



The percent difference from reference of the estimate: 

.Ei-Ki 



qi = 100 - 



I Ki 



The percent bias: 



Q = - £ * 
n r~f 



t=i 



The standard deviation of percent bias: 






The confidence index: 

100 



n 



1 ^I(2S i -\K i -E i \>0) 



(31) 



( 32 ) 



(33) 



( 34 ) 



i=i 



where 1(a) = 1 for a true and 1(a) = 0 for a false. 



APPENDIX A5 

Equations for the Two Compartment PK/Sigmoidal 
Emax Model 



The equations to the PK/PD model described in Fig. 4 are as follows: 



C(t) = 


Ae~ at + Be- 01 


( 35 ) 


C e (t ) = 


^f(E l + E 2 + E 3 ) 


( 36 ) 


E(t) = 


E m *xC e (ty 
CSV + CSo 


( 37 ) 


a = 


(k\2 + k 2 \ + kio) + y/ (ki 2 + k 2 \ + fcio) 2 — 4 k 2 iho 


( 38 ) 


2 


a = 


(k \ 2 + k 2 \ + kio) - yj (k\ 2 + k 2 i + ho) 2 - 4^21^10 


( 39 ) 


2 
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A = 


D a — hi 
Vc a — (3 


(40) 


B = 


D — Ar 2 i 
Vc a — 0 


(41) 


Ei = 


hi - a c _ at 


(42) 


(0 - a) (k eo ~ <*) 


i?2 = 


hi - 0 _ 0t 


(43) 


(a - 0)(k to - 0) 


£3 = 


hi - k eo -keot 


(44) 


(0 — keo)(0 ~ keo) 



In the above equations, D = Dose, V c , Aqo, A72, and h 1 are PK pa- 
rameters to a two-compartment model; k eo is the link effect parameter; 
Emax,C e 50 and 7 are PD parameters to the E max model; C(t) is the 
concentration of the drug in plasma; C e (t ) is the concentration of drug 
in the effect compartment; E(t ) is the effect as it changes with time. 



APPENDIX A6 

Modeling Inter-Occasion Variability 

A method for modeling inter-occasion variability for EM methods is 
described for a simple example. Let clearance CL be modeled to have 
inter-subject and inter-occasion variability, and there are at most m 
occasions for any subject in the data set. Create m additional popu- 
lation parameters, say CL 1 , CL2, ... CLm , constraining the population 
mean of each of these additional parameters to 0, and having them share 
a common but unknown inter-occasion variance Cl{ n tero , with the co- 
variance between the inter-occasion population parameters constrained 
to 0 as well. Population parameters in an EM algorithm can be con- 
strained by forcing the population parameters in question to a certain 
value after each maximization step. The Monte-Carlo algorithm gen- 
erates random samples of CL with inter-subject variability VC Liters, 
and CL1 , CL2,... CLm with inter-occasion variability VCLintero- The 
PK/PD model then uses these values to obtain clearances CL + CLl for 
occasion 1, CL+CL2 for occasion 2, etc. In this manner, different clear- 
ance values are randomly generated for each occasion, with variability 
VCLi n t ero between occasions within a subject, and variability VCLinters 
between subjects. We have verified this method by simulating data with 
inter-occasion variability and analyzing with MC-PEM. 
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INTRODUCTION 

For a linear system, the system input-output relationship can be de- 
scribed by the convolution integral 

t 

y(t) = J I(s)K(t - s)ds (1) 

o 

where y{t ) is the output, I (t ) is the input function and K(t) is the system 
disposition function (i.e., unit impulse response). In a typical bioavail- 
ability study, the system disposition function can be obtained from an 
intravenous (IV) administration, and the desired input function corre- 
sponding to a formulation or route of administration (e.g., oral (OR) 
or topical) is obtained following an oral administration conditional on 
the estimated disposition function (another case of joint estimation of 
system disposition function and input function will be discussed later). 
Typically one can use a parametric model for K(t), for example a mul- 
tiexponential function. To represent the input function I(t), one can 
use a nonparametric function, for example a spline [1], There are two 
main problems with using nonparametric representations in deconvolu- 
tion. First, constraints need to be imposed (for example, to guarantee 
positivity of the estimated input function) which make it a lm ost im- 
possible to derive meaningful second order statistics, e.g., confidence 
bands for the estimated function. Second, in order to use a spline, the 
number and location of breakpoints has to be determined. Previous ap- 
proaches to solve this model selection problem involved positioning the 
breakpoints at equispaced or quantile locations, combined with a model 
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selection criterion, or the use of a smoothing spline. These methods rely 
on ad hoc positioning of the breakpoints, which might have no relation- 
ship with the shape of the underlying input function [2] (If the location 
of the breakpoints is allowed to vary freely, the model selection prob- 
lem of selecting splines with different locations of breakpoints becomes 
quickly unmanageable even if the number of candidate locations is kept 
relatively small.) 

We consider a Reversible-Jump Metropolis-Hastings (RJMH) algo- 
rithm as a possible solution to these problems. The RJMH algorithm 
allows the exploration of a very large collection of splines corresponding 
to alternative positions of breakpoints, it solves the problem of select- 
ing a breakpoint position and obtains the posterior distribution of the 
corresponding (input function) estimate. 

The second inverse problem we consider is the application of Bayesian 
estimation to the problem of unbalanced experimental designs in bioavail- 
ability studies. By unbalanced design we mean those situations in which, 
due to experimental limitations, few or no observations following an IV 
administration are collected. The estimate of the disposition function 
is therefore either imprecise or simply not available and the desired in- 
put functions cannot be obtained using standard methods. We solve 
the problem adopting a Bayesian approach: we specify a prior disposi- 
tion (and input) function and use Markov chain Monte Carlo methods 
(MCMC) to simultaneously obtain the disposition and input function 
from (e.g., OR data ). 

This paper is organized as follows: we (i) describe the RJMH algo- 
rithm, (ii) describe the simulation study used to test the new methodol- 
ogy, (iii) show the results obtained in the simulation study, (iv) describe 
the approach used to solve the unbalanced design problem, and (v) show 
results of a study application to veralipride data. A discussion ends the 
paper. 



RANDOM JUMP METROPOLIS-HASTINGS 

RJMH [3,4] belongs to the family of Markov chains with jumps. In 
general, given a collection of alternative models, the transition (jump) 
between models is constructed in two stages: a proposal transition be- 
tween models and an acceptance probability for the transition. Many 
models can be considered simultaneously, as it is needed in our case, 
and a large number of qualitatively different moves are entertained by 
the random search. The competing models are given prior probabilities 
and a Markov chain Monte Carlo algorithm generates samples from the 
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posterior distribution of the models that can be used for model selection 
purposes. In our application it solves the problem of choosing the loca- 
tion of the breakpoints and has the potential to be particularly useful 
for the recovery of irregular (e.g., multiple-peaked, long-delayed, etc.) 
input functions. Employing a RJMH algorithm simultaneously solves 
the problem of obtaining meaningful high order statistics by obtaining 
the distribution of the estimates, thus solving one the main problems of 
constrained nonparametric estimation. Fig. 1 shows the schematic flow 
chart of the algorithm. There are three major proposal steps: birth, 
death or move. In birth step, a new breakpoint is added. In death step, 
one of the existing breakpoints is removed. In move step, one of the ex- 
isting breakpoints is shifted by a prespecified amount. Corresponding to 
a proposed step, the likelihood is computed, and a Metropolis-Hastings 
type test is performed to determine whether or not to accept the new 
breakpoints after the birth, death or move step. In MCMC terminology, 
each proposal/acceptance contributes one sample, breakpoints in this 
case, of the Markov chain. 

Since there are many input function models of differing dimension, 
and we are assuming that there is no prior information about these mod- 
els, the prior probabilities of the models were specified using a Poisson 
distribution. The Poisson distribution has one argument, which is also 
the mean and variance of the distribution. Therefore, it down-weights 
the model that is away from the mean value. A conservative value for 
the mean of the distribution can be adequate (for example, a possible 
default is 2 plus the square root of the total number of observations), 
because there still remains enough flexibility for transition between dif- 
ferent dimensions and selecting a posterior model with higher number 
of breakpoints (e.g., see Denison et al. [4] and Kang and Verotta [5] for 
further discussion). If previous data on the number of breakpoints are 
available, an alternative is to use the posterior mean value from the anal- 
ysis of the same or similar data (see below the discussion on the best 
model a posteriori). Given the prior probabilities of the models, the 
RJMH performs a birth, death or move step at each iteration. The iter- 
ation continues until the mean squared error (MSE) between the model 
prediction and data attains a specified precision. A detailed description 
of the algorithm can be found in Denison et al. [4] and Kang and Verotta 

[5]. 
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Delete a breakpoint Move a breakpoint 
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Compute a j+ „ L j+1 ,MSE j+1 
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Reject 


Accept death Accept move 
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Fig. 1. Flow chart of the Reversible- Jump Metropolis- Hastings algorithm. £, 
breakpoints; <*i+i , coefficients in the spline representation at the i+lst iteration; Lj+i, 
likelihood at the i+lst iteration; MSEi+i , mean squared error at the i+lst iteration; 
Pp, a random number generated from U(0,1); Pb, probability of going to birth step; 
Pd, probability of going to death step; N, number of candidate breakpoint locations; 
Z, number of candidate breakpoint locations where a new breakpoint cannot be put 
(see also text). 



SIMULATION STUDY 

To evaluate the performance of the RJMH method with respect to 
an alternative state-of-the-art method of determining the position of the 
breakpoints, we performed a Monte Carlo simulation study. We used 
3 test input functions of differing complexity and different number of 
measurements. They are shown in Figs. 2-4. Simulation study 1 used 
a hypothetical input that has positive input followed by negative input. 
The negative input corresponds to a case for which an absorbing sub- 
stance such as charcoal is used to extract some amount of the drug in 
the body. The sampling schedule for this input corresponds to a rather 
typical bioavailability experiment. Simulation study 2 and 3 used input 
functions with higher degree of complexity and use more intensive sam- 
pling. In all cases, a monoexponential disposition function was assumed 
((1/50) • e -0144 t ) for simulation study 1 (Fig. 2), 1 • e _0 693 t for simu- 
lation study 2 (Fig. 3), and 1 . e - 2 - 772t fo r Simulations Study 3 (Fig. 4). 
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We simulated the observations corresponding to each input by solving 
the convolution integral (Eq. (1)) numerically using the NONMEM [6] 
program, and then added measurement error to obtain the simulated 
measurement data by sampling from a normal distribution with mean 
zero and variance 0.01 2 . The number of measurements was 24, 200 and 
200 for simulation study 1, 2 and 3, respectively. For each simulated data 
set, the RJMH algorithm is used to obtain the posterior distribution of 
the input function assuming the true disposition function. As a starting 
point for the RJMH search, we assumed 5 breakpoints located at the 
quantiles of the observations. Then RJMH algorithm was continued un- 
til 3 significant digits in the MSE between predictions and observations 
were obtained. 

QUANTILE BEST MODEL POSTERIOR MEAN 




Fig. 2. Simulation study 1. Examples of the results of deconvolution using the test 
input function 1 (positive input followed by negative input). Leftmost panels: results 
of QUANTILE method, right panels: results of RJMH. In the upper panels, the solid 
line is the true concentration, the open circles are measurements obtained by adding 
additive noise of N(0, 0.01 2 ), and the filled diamonds are the model predictions based 
on the selected spline model. In the lower panels, the solid line is the true input profile 
that was used to obtain simulated output in the upper panels, the filled diamonds are 
estimated input after deconvolution, and the triangles are the breakpoint locations. 

To summarize the posterior distribution obtained by the RJMH al- 
gorithm one can use the best model a posteriori (best model in short 
hereafter), which is the most probable model under the current data. As 
an alternative, one can consider a collection of models with the highest 
posterior probabilities above a certain threshold, or the whole collection 
of models and the corresponding posterior mean input and other func- 
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QUANTILE 



BEST MODEL POSTERIOR MEAN 




Fig. 3, Simulation study 2. Examples of the results of deconvolution using the 
test input function 2 (increasing input rate with a bump in the middle). In the upper 
panels, solid line (hidden under the dots) is the true output, open circles (also hidden 
under the dots) are the simulated measurements, and the filled diamonds are the 
model prediction based on the selected spline model. In the bottom panels, the solid 
line (hidden under the dots) is the true input profile that was used to obtain the 
simulated output of the upper panels, the filled diamonds are the estimated input 
after deconvolution, and the triangles are the breakpoint locations. 



tion predictions. For simplicity, we consider the best model a posteriori 
and the posterior mean to compare with other methods. 

We compare the performance of the RJMH algorithm with a stan- 
dard method, see, e.g., Verotta [2], which, for any number of break- 
points, positions of the breakpoints at the quantiles of the observations 
measurement times, fits the resulting model to the data using a con- 
strained minimization method (QPROG [7]), and chooses the number 
of breakpoints based on the statistical model selection criterion, e.g., the 
Hannan-Quinn criterion [8]: 

HQ = n log ^ ^ - y(*i)) 2 ^ + 2 log (log(n)) • p 

where HU) is the true value, y{U) is the model prediction, n is the 
number of observation, and p is the number of parameters. We refer to 
this method as the QUANTILE method. We did 100 simulation studies 
for each test input function to compare the RJMH and QUANTILE 
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POSTERIOR MEAN 




Fig. 4. Simulation study 3. Examples of the results of deconvolution using the test 
input function 3 (complex input rate). In the upper panels, solid line (hidden under 
the dots) is the true output, open circles (hidden under the dots) are the simulated 
measurements, and the filled diamonds are the model prediction based on the selected 
spline model. In the bottom panels, the solid line (also hidden under the dots) is the 
true input profile that was used to obtain the simulated output of the upper panels, 
the filled diamonds are the estimated input after deconvolution, and the triangles are 
the breakpoint locations. 



methods. To this end, we computed the average number of breakpoints 
and the following quantities: 

NMSE (Normalized Mean Squared Error), 

E(f(U)-y(U )) 2 

NMSE = l -~ 

E (Hu) - I ) 2 

i= 1 

NMSB (Normalized Mean Squared Bias), 

e mu) - mf 

NMSB = ^ 

E (f -HU )) 2 

i=l 



In the above equations, / is the mean of the true values, y(U) is the 
mean model estimate over the total number of simulations. 
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RESULTS 

One example from simulation study 1 is used to show the posterior 
distribution of the number of breakpoints and is illustrated in Table 1. 
For this particular simulated data set, 3 to 1 1 breakpoints were selected 
during the iterations of RJMH (firsts column of Table 1). The second 
column of Table 1 shows the probability that as many breakpoints as 
the number in the first column was selected. Among the same number 
of breakpoints, the third column shows the index of the one position of 
breakpoints that has the smallest mean squared error. Therefore, the 
best model a posteriori for this example is the one designated by index 
384 among all the models with 8 breakpoints. 



Table 1. The best model a posteriori and the associated probability. 



Nxi“ 


P(Nxi) 6 


Index(Best Model) c 


3 


0.00205 


20 


4 


0.0137 


117 


5 


0.0665 


166 


6 


0.164 


285 


7 


0.251 


430 


8 


0.268 


384 


9 


0.165 


280 


10 


0.0593 


371 


11 


0.0106 


69 



“Number of breakpoints. 

fc The probability that the corresponding number of breakpoints 
was selected during the RJMH iteration. 

“Index of the best model a posteriori, which has the smallest 
mean squared error among the same breakpoint number models. 



Table 2 shows the comparison statistics with respect to output or in- 
put together with the mean number of breakpoints. Underlined quanti- 
ties are the smallest values obtained among different methods. In most 
of the cases, the smallest value came from either the best model or 
the posterior mean. When the smallest value is from the QUANTILE 
method, the difference is minimal. Overall, better or comparable results 
were obtained with the RJMH with respect to the QUANTILE method, 
and almost always a significantly smaller number of breakpoints were 
necessary to obtain similar or better NMSE and NMSB. The computing 
time required for running the RJMH algorithm was about two times 
longer than the quantile method. For example, the computation time 
was 1 h 40 m vs. 3 h 30 m for simulation study 1 and 17 h vs. 31 h for 
simulation study 3 using Sun Spare 5 330 Mhz. 
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Table 2. Comparison of quantile breakpoints and RJMH for deconvolution using 
the test input function. 



Simulation 1 


NMSE“ 

output 


NMSE 6 

input 


NMSB“ 

output 


NMSB'' 

input 


Nxi* 

avg 


Quantile 


6.95E-3 


0.210 


328E-5 


305 


12 


Best Model 


3.41E-3 


0.128 


40E-5 


185 


6 


Posterior Mean 


3.18E-3 


0.126 


80.7E-5 


185 


6 



Simulation 2 



Quantile 


3.16E-4 


2.54E-3 


3.12E-4 


2.10E-3 


U 


Best Model 


3.16E-4 


3.59E-3 


3.10E-4 


1.41E-3 


11.1 


Posterior Mean 


3.14E-4 


2.29E-3 


3.10E-4 


1.43E-3 


11.3 



Simulation 3 



Quantile 


1.69E-4 


2.47E-3 


47E-6 


14.3E-3 


12.8 


Best Model 


2.14E-4 


5.68E-3 


3.05E-6 


1.31E-3 


12.3 


Posterior Mean 


1.52E-4 


2.56E-3 


4.09E-6 


1.78E-3 


13.1 



“Normalized mean squared error for the predicted output. 
h Normalized mean squared error for the estimated input. 
“Normalized mean squared bias for the predicted output. 
^Normalized mean squared bias for the estimated input. 
“Average number of breakpoints. 



UNBALANCED EXPERIMENTAL DESIGN 

The joint estimation of input and disposition functions given (i) IV 
and OR observations, (ii) prior information on the probability distribu- 
tion of the disposition function, and (iii) the prior probabilities and as- 
sociated distribution of a collection of candidate input functions, poses 
in principle no additional problems. The only modification to the al- 
gorithm is that the disposition function is not fixed at the estimated 
values as in the sequential analysis, but instead its prior distribution is 
taken into consideration when obtaining the posterior distribution us- 
ing MCMC. However, an interesting Bayesian estimation problem arises 
when the experimental design is unbalanced, and fewer or no observa- 
tions following an IV administration are collected. This can happen due 
to cost considerations or experimental impossibility (when an IV formu- 
lation of the drug does not exist), or when observations are collected 
in two separate groups of people, one receiving the IV and the other 
the OR administration. The problem has received much attention in 
the pharmacokinetic literature (e.g., [9] and [10], and similar studies re- 
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Table 3. Estimates of system disposition functions (of the form 
K (t) = A ■ e~ K ' 1 + B ■ e~ Ki t ) for the three subjects of the veralipride 
model. 





A 


K\ 


B 


k 2 


SBJ1 


6.66 


0.306 


40.6 


7.55 


SBJ2 


5.88 


0.199 


51.9 


8.41 


SBJ3 


4.36 


0.173 


32.2 


5.26 


Mean 


5.63 


0.226 


41.6 


7.07 



ported afterwards) but no statistically sound methodology has been put 
forward to solve the problem. 

To investigate the solution of this problem using Bayesian MCMC 
methods we used a set of real veralipride data [11] in which the same 
group of individuals received both IV and OR dose. The number of ob- 
servations following the IV administration is artificially pruned to zero 
to compare the estimation results for the corresponding cases. The prior 
information about the biexponential system disposition function was ob- 
tained from a preanalysis of three subjects and is shown in Table 3. The 
prior mean for the three subjects was used in the Bayesian estimation 
and, due to the limited number of available subjects to obtain variance 
information, the prior standard deviation was set to be 50% CV with 
respect to the mean of each parameter. 



RESULTS: VERALIPRIDE DATA 

Fig. 5 shows results obtained by MCMC using the veralipride data. 
The left panels show the OR concentration data and the right panels 
show the IV concentration data. From the top right panel to the bottom 
right panel, the number of IV data is pruned from the original 19, to 5 
and 0. The crosses denote the deleted measurements. The dotted lines in 
the figure are the 95% highest posterior density (HPD) intervals for the 
predicted quantities. The departure between the predictions obtained 
using 19 observations and the cases using 5 or 0 increases when the 
number of IV observation decreases. In other words, the amount of 
uncertainty in the predicted IV concentration increases as the amount 
of IV data decreases. However the predictions and the HPD intervals 
associated with the oral observations are changed very little when the 
number of IV observations decreases. To evaluate the robustness of the 
method, we performed a small sensitivity analysis study in which the 
prior standard deviation was specified as 25%, 50% or 100% CV with 
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respect to the mean, and the estimated posterior mean input function 
was compared with the estimate obtained using 19 IV data (the best 
possible estimate). As expected when the prior variance is larger, the 
departure from the best possible estimate increases, a significant bias is 
however only introduced for the 100% CV case. 



ORAL CONC 



IV CONC 





Fig. 5. Result of joint estimation using veralipride data. In the left panels, open 
circles are measurement data after oral administration, solid lines are the model pre- 
dictions. In the right panels, open circles are measurement data after IV administra- 
tion, crosses are deleted data, solid lines are the model predictions. The number of 
IV data is 19, 5 and 0, from top right panel to bottom right panel. The dotted lines 
are 95 % HPD (Highest Posterior Density) intervals. 
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DISCUSSION 

A RJMH algorithm was successfully applied to the deconvolution 
problem and its performance was demonstrated with several examples. 
The method is especially useful when the function to be retrieved is 
highly irregular (i.e., double-peaked, triple-peaked, long-delayed, etc.) 
because of its ability of adapting to the data by putting more break- 
points wherever they are necessary. As usual with MCMC methods, the 
RJMH required longer computation time than the simpler QUANTILE 
method. Moreover, for retaining posterior information on the break- 
point locations a rather large storage space is needed depending on the 
complexity of the problem. However both computer speed and storage 
are becoming less of a problem as computer technology develops. 

One reason for the large computing time is also due to the exceed- 
ingly large number of alternative input function models that need to 
be considered when there is no prior information on the input function. 
When the prior information on the number and position of the break- 
points is more informative the computing times required by RJMH drops 
dramatically. 

We also showed results of the preliminary investigations on the use 
of prior information in the estimation of disposition and input functions 
in unbalanced bioavailability studies. By introducing a Bayesian prior 
on the disposition function parameters and input function parameters, 
it was shown that the unknown input function following an oral dose 
might be retrieved even when no IV observations were available. A 
sensitivity analysis study confirmed that the method is quite robust. 
The results of this study are encouraging, suggesting that a Bayesian 
MCMC method has the potential to solve an outstanding problem in 
bioavailability studies. Further investigation is necessary to study the 
limitations of the approach. Even if the proposed method will prove 
successful only under stringent conditions (for example, only when the 
prior variance is small), the main advantage over existing methods [9,10] 
is that it allows a precise quantification of the precision of the desired 
estimates, which can be used to judge objectively the reliability of the 
conclusions obtained from an unbalanced bioavailability study design. 

The RJMH algorithm was written in Fortran 77 and the program can 
run stand-alone. Interested readers should contact the author (dong- 
woo.kang@alza.com) to obtain a copy. 
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INTRODUCTION 

One of the more important applications of mathematical modeling in 
pharmacokinetics and pharmacodynamics involves its use in estimating 
those properties of a system that are not amenable to direct measure- 
ment. In pharmacokinetics/pharmacodynamics (PK/PD), this problem 
of model parameter estimation has historically been solved using non- 
linear least squares estimation. In PK/PD modeling studies in which 
measured data are sparse and prior information is available about model 
parameters, Bayesian inference is applicable. Early work in this area in- 
cludes the report of Sheiner and his colleagues [1] who estimated digoxin 
clearance as the mode of its posterior distribution given measurements 
of digoxin plasma concentration in a subject and a known prior distri- 
bution for clearance in the population (maximum a posteriori estimate, 
MAP). Katz, Azen and Schumitzky [2,3] reported the first efforts to 
calculate the complete posterior distribution of model parameters in a 
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nonlinear pharmacokinetic estimation problem, for which they used nu- 
merical quadrature methods to perform the needed multi-dimensional 
integrations. Following this work, others reported the use of a variety of 
numerical quadrature, analytical and other approximation techniques for 
computations involving posterior densities (e.g., [4—7]). These methods, 
however, were found to lack the robustness needed for general nonlinear 
modeling applications, including those arising in PK/PD. 

More recently, sampling-based methods (e.g., [8-11]), and in partic- 
ular Markov chain Monte Carlo (MCMC) methods (e.g., [12-17]), have 
been developed that provide a viable computational approach for com- 
plex Bayesian inference problems in a wide range of application areas. 
These methods have been applied to nonlinear Bayesian inference prob- 
lems in PK/PD involving population (hierarchical) models (e.g., [18-23], 
see also the chapters by Rosner et al. and Wu et al. in this volume) and 
individual models (e.g., [24-28]). 

In this chapter we explore the implementation of the regenerative 
MCMC methods proposed by Mykland, Tierney and Yu [29], and their 
application to Bayesian inference involving individual PK/PD modeling 
problems. The major motivation in pursuing regenerative methods is 
to provide a convenient method for estimating standard errors of Monte 
Carlo estimates, in order to allow inferences about PK/PD properties 
for individual subjects. The chapter is organized as follows. The next 
section provides a general modeling and individual Bayesian estimation 
framework for PK/PD models. Our implementation of the regenerative 
MCMC hybrid algorithm proposed by Mykland, Tierney and Yu [29] 
is then summarized. The subsequent section describes the application 
of a technique [29] for determining the regeneration times in a Markov 
Chain produced by a basic MCMC sampler, thus leading to identification 
of independent, identically distributed chains (chain splitting). Next 
the methods used for analyzing regenerative processes are presented. 
The two regenerative approaches are then applied to a pharmacokinetic 
estimation problem and the results evaluated through simulation studies. 



INDIVIDUAL ESTIMATION IN PD/PD USING 
BAYESIAN INFERENCE 

We assume that the PK/PD of the drug under study can be modeled 
as a continuous dynamic system with discrete time measurements of the 
drug’s kinetic and/or dynamic responses, as given below: 
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= /(x(f),Q!,r(f),f), *(0) = c 

z{tj) = + e(tj), j * 1, .... m (1) 

where the state vector x(t) (nxl) can represent compartment amounts, 
tissue concentrations, cell concentrations, effect site concentrations, phys- 
iological variables, ligand concentrations, signaling pathway proteins, 
etc. In Eq. (1), the vector r(t) (kx 1) represents drug infusion regi- 
mens (allowing for administration and modeling of multiple drugs) as 
well as any patient- specific covariates (e.g., cli ni cal variables affecting 
PK/PD processes). In addition, drugs can be administered as bolus in- 
puts, represented by the vector 6(f) (nxl), that are assumed to change 
states instantaneously (i.e., x(dt,+) = x(dt~) + 6(df,), i = 1,...,). The 
vector a (pxl) represents the collection of all model parameters (rate 
constants, distribution volumes, partition coefficients, effect site clear- 
ances, maximum responses, biomarker production and turn over rates, 
etc.), and is random with known probability density function p(a). Mea- 
surements of the drug’s kinetic and/or dynamic responses are repre- 
sented by the lx 1 vectors z(tj), j = 1, . . . , m. It is assumed that these 
measurements are subject to random errors that are independent with 
zero mean. For notational convenience the vector z (mix 1) is used to 
represent the measurements ( z = [z(t\) T , , z(t m ) T ] T ), e (mix 1) de- 
notes the errors (e = [e(fi) T , ...,e(f m ) r ] r ), and the vector y(a,t ) rep- 
resents the model outputs h(x(t),a,r(t),t). It is further assumed that 
Var(ei(tj)) = gi(y(a, tj)), j = 1, . . . , m, * = 1, . . . , l, where the functions 
gi are known. 

Given this pharmacokinetic/pharmacodynamic model formulation, 
Bayes’ theorem is applied to estimate the posterior distribution of the 
model parameters p(a\z) given the measurements z and the prior distri- 
bution p(z): p(ct\z) = l(a\z)p(a)/ J l(a\z)p(a)da. The likelihood func- 
tion l(a\z) is readily obtained given the model definition above and an 
assumption about the error distribution. We are interested in making 
inferences about features of the posterior density of the model parame- 
ters p(a\z) (e.g., means, quantiles), as well as features of model response 
density p(j/(a,f)|z) and other functions of model parameters. As noted 
above, these quantities can be estimated using MCMC methods. 
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REGENERATION VIA HYBRID MCMC 
ALGORITHM 

As proposed in [29], regeneration can be introduced into an MCMC 
sampler using a hybrid approach in which a basic sampler is embedded 
within a rejection sampling algorithm. Following [30], the algorithm 
described below uses the independence Metropolis-Hastings (IMH) al- 
gorithm as the basic sampler component of the regenerative sampler 
(RIMH sampler). 

The RIMH algorithm first initiates the chain in some defined param- 
eter set (steps 1 and 2 in the algorithm), and the times at which the 
chain returns to the set are the regeneration times (step 5). In the al- 
gorithm below, step 4 is the rejection sampling stage and step 6 is the 
IMH acceptance criterion. 

1. select c : 0 < c <max l(a\z ) ; define setC = {c* : l(a\z) < c} 

2. select current state of chain a* 6 C 

3. for n — 1, . . repeat 4-6 

4. sample a from p(a) and ul from (7(0,1) until u\ < l(ctj\z)/c, 
when currently at a t = a n 

5. for a e C, then a n+ i — aj 

6. sample u2 from (7(0,1) 

i) for aii g C and aj € C, 

if u2 < c/l(ai\z), then a n+ i = aj, elsea n+1 = aj 

ii) for a t # C and aj C, 

if u2 < l(ai\z)/l(ai\z), then a n+ 1 = aj,else a n+ i = a* 

This algorithm produces the chain {aj, . . . ,awi,<*;v 1 +i> . . . , ajv 2 > • • • , 
a N „}, where Ni, N 2 , . . . , N n are random regeneration times defining in- 
dependent, identically distributed subchains. Methods for analysis of 
this regenerative process as well as algorithm implementation issues, in- 
cluding selection of the regeneration constant c, are discussed below. 



REGENERATION BY SPLITTING A MARKOV 
CHAIN 

The second approach involves identifying regeneration times in a 
Markov chain produced by a basic sampler, and is based on the ideas of 
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Nummelin [31] for splitting a Markov chain. The algorithm described 
below splits the chain produced by the IMH sampler as proposed in [29] 
(SIMH sampler). Steps 1-4 are the basic IMH algorithm, while steps 5-8 
split the IMH chain following Nummelin’ s method. 

1. initialize chain at cto 

2. for n— 1 ,N, repeat 3 & 4 

3. sample atj from p(a), when currently at ot{ — a n 

4. sample ul from 1/(0, 1) 

if ul < l{atj, z)jl{oti\z)y then a n+ i = c*j, else ctn-t-i = 

5. select splitting constant c > 0; set k — 1 

6. for i = 1, . . . , N, repeat steps 7 & 8 

7. while ctj / c*t+i, sample u2 from t/(0, 1) 

8. implement probabilities for regeneration 

i) if l(ai\z)>c, ((cq +1 |z)>c and u2<max{c/((ai|z), c//(aj + i|z)}, 

accept Nfc = i as next regeneration step 

ii) if l(ai\z)<c, l(a(+i\z)<c andu2<max{c//(aj|z), c//(aj + j|z)}, 

accept Nfc = i as next regeneration step 

iii) else 

accept Nfr — i as next regeneration step 

9. increment k 

With the identification of the regeneration times N\, IV 2 , . . . , N n , the 
chain generated by the IMH sampler can be then written as {qi , . . . , ajVj , 
ajVj+i, . . . ,aAr 2 , • • . ,ocn„}, • Selection of the splitting constant c in step 
5 as well as other implementation issues are discussed below. 

ANALYSIS OF REGENERATIVE PROCESSES 
AND IMPLEMENT AION OF RIMH AND SIMH 
ALGORITHMS 

The chains produced by the RIMH and SIMH algorithms can be 
used to estimate quantities of interest for the model parameters, such 
as means, standard deviations, credibility intervals and quantiles. To 
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obtain confidence intervals for these quantities, we apply the methods 
introduced by Crane and Iglehart [32] for analyzing a regenerative pro- 
cess. We outline below calculation of confidence intervals for means, and 
quantiles of model parameters, as well as for functions of the parameters 
such as model outputs. 

Sample Means 

Let {Xi, X %, . . .} represent the chain of interest (e.g., model param- 
eters, model outputs) and {N\, N %, . . . , N n ] the increasing sequence of 
random stopping times that divide the process into a collection of iid 
subprocesses. We define the length and total observation of each sub- 
chain as follows: 



Vk = Xk~ N k - 1 


(2) 


Nk 




% 

w 

II 


(3) 



i=N k - i+I 



were {(v k ,Yk : k) < 1} are of iid random vectors [33]. Letting v and 
Y denote the respective means of the sequences, then E[X] can be 
approximated by the classical ratio estimator as Y/v (n is the number 
of subchains or tours of the process). We have also applied and tested 
other ratio estimators (e.g., Beale, Fieller, Jackknife, and Tin [33]) as 
described in [34] and have adopted the classic ratio estimator for our 
application based on its overall performance on representative problems 
and ease of implementation. We discuss below the effect of the number of 
subchains n on the accuracy of the central limit theorem approximation 
in the classic ration estimator. 

The asymptotic 95% confidence interval for the classical ratio estimate 
of can be calculated as follows [32]: 



were 




(4) 

(5) 



Sample Quantiles 

The approach described above can be used to estimate other moments 
of a regenerative process {Xi, X 2 , . . .}, or in general any real-valued 
function f(X) (replace X x by f(X t ) in Eq. (3)). We are interested in 
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estimating the pth quantile of the distribution of X, F(X) and its con- 
fidence interval. Letting F n (x) denote the empirical distribution of X 
based on n tours of the regenerative process, then the sample pth quan- 
tile of F(X) is Q{p) =inf{x:F n (a;) > p}. For a discrete time regenerative 
process with continuous state space (as in the PK/PD application de- 
fined above), the sample quantile based on n subchains is calculated as 
follows: 



Qnip) = ([A n p] + l)-st order statistic, N n p is not integer 

= {N n p)~ th order statistic, N n p is integer 

where [x] is the greatest integer less than or equal to x . This is the 
empirical distribution method for quantile calculations [35], which can be 
computationally expensive in that it involves sorting the entire chain (see 
[34] for discussion and evaluation of other approaches). The confidence 
interval for the Q(p) is then calculated as per the presentation given 
above for the sample mean, using an indicator function f(x) = l^X), 
where A = {x : x < x 0 } (in Eq. (3) replace X t by l^(AT)). 

Implementation of RIMH and SIMH Algorithms and 
Analysis 

The two regenerative sampling algorithms were implemented in the 
ADAPT software package for PK/PD systems analysis, for the general 
model formulation given in Eqs. (1,2). In addition, the regenerative 
process analysis methods described above were incorporated to provide 
estimates and confidence intervals for the mean, quantiles and other 
moments of model parameters and responses. The general implementa- 
tion of the RIMH and SIMH algorithms in ADAPT allowed testing and 
evaluation of the regenerative methods on a variety of PK/PD models 
and provided an opportunity to study important implementation issues. 
In the example to follow, all computations were performed on a Sun 
workstation (Ultra Spare 10). 



EXAMPLE: TENIPOSIDE 
PHARMACOKINETICS 

Population PK Model of Teniposide 

The antineoplastic drug teniposide has been used as a single agent and 
in combination with other drugs in the treatment of children with acute 
lymphocytic leukemia, lymphomas and neroblastoma (e.g., [36-37]). An 
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exposure/response relationship has been demonstrated between area un- 
der the teniposide blood concentration-time curve (AUC) and reduction 
(>75%) in leukemic blast cell count, as well as between AUC and toxic- 
ity (moderate to severe mucositis) [36]. A population pharmacokinetic 
model has been used successfully to adaptively control the AUC of teni- 
poside in individual patients as part of a Phase I study [38]. In an effort 
to incorporate pharmacokinetic uncertainty into dose regimen design for 
teniposide, a stochastic control approach for targeting the AUC of teni- 
poside was developed and evaluated previously [39] for which discrete 
density approximation [40] was used to calculate the required posterior 
distributions of the model parameters. In what follows, this example is 
used to evaluate the application of the RIMH and SIMH algorithms. 

A two compartment linear model was used to model the plasma con- 
centration of teniposide following an intravenous infusion of the drug. 
Based on the results of a population study of teniposide’ s pharmacoki- 
netics in children with cancer, the model parameters (K pc ,K cp , V, CL) 
were determined to be multivariate log normally distributed with mean 
(n) and covariance (fl) given in Table 1. 



Table 1. Mean and covariance for the population parameters 



Parameter 


Mean 




Covariance 










Kpc 


Kcp 


V 


CL 


Kpc 


0.189 (hr" 1 ) 


0.0112 








Kcp 


0.286 (hr -1 ) 


0.0024 


0.0081 






V 


2.04 (L/nd) 


-0.0247 


-0.0116 


0.163 




CL 


0.652 (L/m7hr) 


-0.00030 


-0.136 


0.0697 


0.0778 



Simulated Sample Subjects 

Simulated teniposide concentrations were generated from three sub- 
jects selected to represent individuals with low, median and high values 
of AUC in the population. The model was simulated using samples from 
the population distribution (drug infusion rate of 60 mg/nr/hr over four 
hours), and used to identify representative parameters corresponding to 
the 10 th , 50 tft and 90 i/l percentiles of the 24 hr AUC in the population. 
The values for these three sets of model parameters are given in Table 2, 
along with the corresponding values of the 24 hr AUCs denoted AUCl, 
AUCaz, AUC#. Random error ( Var(e(t )) = (0.1y(o;, t) + 1.0) 2 , see prob- 
lem definition above) was added to the teniposide concentrations at 1.5, 
3.5, 8 and 24 hr predicted by each of these three parameter sets and 
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used as the simulated data in the evaluation of the RIMH and SIMH 
algorithms. 



Table 2. Parameter values and for the three selected subjects from the Teniposide 
population representing the low (AUCl), medium (AUCaj) and high (AUC„) AUC 
groups. 



Parameter 


AUC,. 


AUCa, 


AUCh 


Kpc (hr -1 ) 


0.106 


0.0908 


0.192 


Kcp (hr -1 ) 


0.143 


0.372 


0.644 


V (L/nd) 


2.29 


2.38 


1.45 


CL (L/m'Vhr) 


1.02 


0.539 


0.235 



Results using RIMH Algorithm and Coverage Study 

Selected moments and other quantiles from the estimated posterior 
distributions of the parameters are presented in Table 3 for each of the 
three subjects. Figure 1 displays the marginal posterior densities of 
each model parameter and that of AUC(0,24) for the two subjects with 
low and high values of AUC. Table 4 summarizes the characteristics 
of the regenerative chain including the number and average length of 
the subchains, as well as the value of the regeneration constant c (ex- 
pressed as a percent of maxl(a)2)) used to define the regeneration set. 
For AUC(0,24) and the parameters V and CL, Table 5 lists the confi- 
dence intervals calculated from the regenerative analysis for the mean 
and selected quantiles of the posterior density. 

In the RIMH algorithm, selection of the regeneration constant c is 
bounded by the maximum value of the likelihood. Our implementation 
uses randomly drawn samples cq from the prior parameter density (5000 
in the Teniposide example) to approximate the maximum value of the 
likelihood function (max(/(a t |z),i = 1,...)). The particular value then 
chosen for the regeneration constant c determines the size of the regen- 
eration set and thus the length of the subchains. If the value selected for 
c is low, then there will be fewer and longer subchains for a given over- 
all chain length; a high value of c produces more subchains of shorter 
length. Calculation of the confidence intervals via the classic ratio es- 
timator is based on the Central Limit Theorem and therefore requires 
an adequate number of subchains. For the three data sets analyzed, the 
regeneration constant c was selected (via trial and error with test runs) 
to generate subchains with an average length of at least 10 samples, 
while the length of the overall chain was set to produce 100 subchains. 
The important of the length of the subchains on the asymptotic confi- 




p(Kpc) 
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Fig. 1 . Estiated marginal posterior distributions and posterior density of AUC for 
the two simulated subjects with low and high values of AUC. The posterior distribu- 
tions were obtained with RIMH algorithm. 
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Table 3. Posterior parameter distribution results for the three selected subjects 
obtained using the RIMH algorithm. 





Kpc (hr *) 




Kcp (hr *) 


Subject 


AUCj- 


AUCm 


A UCh 




I AUCz, | 


AUCa/ 


AUCh 


Mean 


0.190 


0.150 


0.163 






0.275 


0.561 


Mode 


0.152 


0.128 


0.146 




0.209 


0.250 




Std.Dev. 


0.0927 


0.0628 






0.0589 






95% h.p.d. 


[.03, .36) 


[.05, .27) 


■EDEM 




[.14, .36] 


MEgBl 


| [.39,. 80] 





V (L/m^/hr) 




CL (L/m'Vhr) 


Subject 


AUCt 


AUCa/ 


AUC W 




AUC L 


AUCa/ 


AUC„ 


Mean 


2.31 


2.21 


1.68 




0.929 


0.671 




Mode 


2.32 


2.22 


1.69 




0.925 


0.673 




Std.Dev. 


0.256 


0.192 


0.105 




0.126 


0.104 


0.0436 


95% h.p.d. 


[1.8,2.8| 


[1.9,2.61 


[1.5,1.81 




[0.69,1.2) 


© 

4— 

00 

o 

bo 

00 


[0.16,0.32] 



Table 4. Characteristics of the RIMH chains for the three subjects. 



Subject 


AUC t 


AUCa/ 


AUC„ 


c (% of max<(a| 2 )) 


80% 


72% 


99% 


Avg. subchain length 


15.7 


10.9 


18.7 


# Posterior samples 


1565 


1093 


1866 


# Rejected samples 


4354 


1860 


93141 


Rejection ratio 


0.736 


0.630 


0.980 


Run time 


46 s 


32 s 


7 m 



A total of 100 subchains for each subject. 



dence intervals is discussed below. Inspection of Table 4 indicates that 
the particular value of the regeneration constant c needed to meet the 
above conditions is dependent on the particular data set. 

To evaluate the confidence interval results presented in Table 5 a cov- 
erage study was performed. For each of the three subjects (Table 2), 100 
data sets were generated using the error model given above. The RIMH 
algorithm was used to obtain the 95% confidence intervals for the mean 
and selected quantilies of the four model parameters and AUC(0,24) for 
each of the 100 data sets. To evaluate the coverage of the confidence 
intervals, estimates of the true mean and quantiles of the posterior den- 
sity were obtained from the average of five long chains (20,000 samples 
each) generated using the nominal data set. The resulting percent cov- 
erage of the 100 calculated confidence intervals is given in Table 5 for 
the posterior mean and selected quantiles of the parameters V, CL and 
AUC(0,24), for each of the three subjects. In the analysis of each of the 
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100 data sets (for a given subject) the same value of the regeneration 
constant c was used. While the selection of this regeneration constant 
had only a modest effect on the percent coverage of the confidence in- 
tervals for the expected values of the model parameters, the confidence 
intervals for the qualtilies could be significantly influenced by the value 
used to define the regeneration set. In particular when the number of 
distinct values of the random variable within a subchain was small (less 
than 10) the calculated coverage of the 95% confidence intervals for the 
quantiles could diverge significantly from the nominal value. Accord- 
ingly, in the coverage study simulations, the values of the regeneration 
constants for each of the three subjects was selected to yield no fewer 
than 10 unique sample values in any of the subchains for each of the 100 
simulated subjects. 



Table 5. For the RIMH algorithm, 95% confidence interval for selected posterior 
quantities together with % coverage. 



V ( L/m 2 ) 95% Cl (% coverage) 



Quantity 


AUCr, 


AUCa/ 


AUCh 


E(V) 




2.29, 2.33] (94%) 


[2.19, 2.22] (92%) 


[1.66, 1.69] (94%) 


Q(.025) 




1.82, 1.89] (92%) 


[1.84, 1.89] (93%) 


[1.45, 1.49] (88%) 


Q( 05) 




1.88, 1.89] (96%) 


[1.89, 1.93] (96% 


[1.46, 1.53] (92%) 


Q(.5) 


1 


2.27, 2.32] (88%) 


[2.19, 2.22] (91%) 


[1.68, 1.71] (93%) 


Q(.95) 




2.71, 2.84] (94%) 


[2.49, 2.59] (94%) 


[1.82, 1.85] (88%) 


Q(.975) 


1 


2.84, 2.95] (92%) 


[2.58, 2.69] (94%) 


[1.84, 1.91] (89%) 



CL (L/ad/hr) 95% Cl (% coverage) 



Quantity 


AUCt 


AUCa/ 


AUC W 


E(CL) 




[0.920, 0.938 


| (92%) 




[0.662, 0.679] (95%) 


[0.231, 0.246] (94%) 


Q(.025) 




[0.641, 0.692 


(93%) 




[0.450, 0.483] (96%) 


[0.137, 0.164] (93%) 


Q(.05) 




0.695, 0.744 


| (89%) 




0.482, 0.511] (95%) 


[0.110, 0.182] (91%) 


Q(-5) 


1 


0.917, 0.937 


| (92%) 




0.655, 0.685] (94%) 


[0.233, 0.252] (94%) 


Q(.95) 


[1.11, 1.17] (94%) 




[0.826, 0.855] (94%) 


[0.303, 0.315] (99%) 


Q( .975) 


[1.17, 1.21] (95%) 




[0.854, 0.899] (96%) 


[0.315, 0.327] (99%) 



AUC(0,24) (/rg-hr/ml) 95% Cl (% coverage) 



Quantity 


AUCt 


AUCa/ 


AUC// 


E(AUC) 


[551, 560] (94%) 


[669, 678] (90%) 


[884, 901] (95%) 


Q(.025) 


[415, 441] (93%) 


[542, 560] (93%) 


[772, 797] (91%) 


Q(.05) 


[429, 459] (93%) 


[561, 578] (93%) 


(776, 811] (89%) 


Q(-5) 


[548, 560] (94%) 


[665, 680] (97%) 


[883, 905] (95%) 


Q(.95) 


[667, 682] (91%) 


[764, 778] (96%) 


[973, 990] (92%) 


Q(.975) 


[685, 717] (92%) 


[778, 805] (95%) 


[989, 1010] (93%) 
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Results using SIMH Algorithm and Coverage Study 

Table 6 lists selected moments and other quantities from the estimated 
posterior distributions of the three subjects obtained using the SIMH 
algorithm. The results are in agreement with the corresponding results 
obtained form the RIMH algorithm shown in Table 3 to within two 
digits. Table 7 summarizes the characteristics of the regenerative chain 
including the number and average length of the subchains, as well as 
the value of the splitting constant c used to define the regeneration 
times. For AUC(0,24) and the parameters V and CL, Table 8 lists the 
confidence intervals calculated from the regenerative analysis for the 
mean and selected quantiles of the posterior density. 



Table 6. Posterior parameter distribution results for the three selected subjects 
obtained using the SIMH algorithm. 





Kpc (hr *) 




Kcp (hr *) 


Subject 


AUCl 


AUCa/ 


AUC// 




AUCl 


AUCa/ 


AUC// 


Mean 


0.186 


0.154 


0.162 




0.231 


0.277 




Mode 


0.152 


0.128 


0.146 




0.209 


■OVW 




Std.Dev. 


0.0895 


0.0626 


0.0422 




0.0563 


0.0635 




95% h.p.d. 


[.036, .35) 


[.047, .27] 


[.089, .25) 




[.14, .35) 


[.15, .39) 


| [.36, 080] 





V (L/nri/hr) 




CL (L/nri/hr) 


Subject 


AUCl 


AUCa/ 


AUC// 




AUCl 


AUCa/ 


| AUC// 


Mean 


2.31 


2.21 


1.68 




0.930 


0.673 




Mode 


2.32 


2.22 


1.69 




0.925 


0.673 




Std.Dev. 


0.254 


0.190 


0.110 




0.121 


0.101 


0.0429 


95% h.p.d. 


[1.8, 2.8) 


[1.9, 2.6] 


[1.5, 1.9) 




[-67, 1.1] 


[.46, .86] 


[.16, .32] 



To determine the burn-in period for the IMH algorithm, autocorrela- 
tion plots were constructed and samples were retained after the correla- 
tion dropped to below 0.05 for all parameters. This resulted in burn-in 
periods of 100, 1,000 and 10,000 samples for the low, median and high 
AUC subjects. The disparity in burn-in times reflects the differences 
in the mixing of the chains for the three different data sets. The total 
number of samples in each chain (see Table 7) was set based on consid- 
eration of the number and length of subchains generating from the chain 
splitting algorithm. 

As a guide in the selection of the splitting constant c , the likelihood 
was evaluated at random samples of the parameters (5000 in the Teni- 
poside example) drawn from the prior parameter density. The samples 
were used to define the minimum (min(/(aj|z), i = 1, . . .)) and average 
(E(l(ai\z), i = 1, . . .)) values of the likelihood, and the splitting constant 
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Table 7. Characteristics of the SIMH chains for the three subjects. 



Subject 


AUCt 


AUCa/ 


AUCh 


f split" (0-1) 


0.01 


002 


0.001 


# Subchains 


105 


110 


136 


Avg. subchain length 


93.9 


41.9 


127 


Avg. # unique samples in subchain 


14.4 


10.8 


11.8 


# Burn-in samples 


100 


1,000 


10,000 


# Posterior samples 


10,000 


5,000 


200,000 


MH acceptance ratio 


0.1490 


0.237 


0.00849 


# Acceptance 


1,485 


1,185 


1,496 


Run time 


47 s 


28 s 


15 m 



“The splitting constant is c =min!(aj|z) + / split ■ £(t(ai|z)). 



was then selected as: min /(a*) < c < E(l(a l \z)) . For a given compos- 
ite IMH chain, low values of the splitting constant produce fewer and 
longer subchains while large values of c yield more subchains of shorter 
lenghth. The trade-off in selecting the value of the splitting constants 
is the same as that in selecting the regeneration constant as discussed 
above. For the results presented in Table 6, the splitting constant and 
total chain length were set to produce approximately 100 subchains with 
on average a minimum of 10 unique samples per subchain. To assess the 
confidence intervals calculated using the SIMH algorithm, a coverage 
study was performed as described above for the RIMH algorithm. Table 
8 gives the percent coverage of the 95% confidence intervals for selected 
parameters. 



DISCUSSION 

This chapter presents results of a general implementation of two re- 
generative MCMC methods as proposed by Mykland, Tierney and Yu 
[29], and applies them to an individual PK/PD estimation problem. 
These regenerative methods allow calculation of asymptotic confidence 
intervals of estimated quantities obtained from posterior distributions of 
model parameters, outpus and other functions of the parameters, when 
combined with regenerative simulation analysis methods. They also pro- 
vide useful diagnostics for assessing convergence and mixing of Markov 
chains. 

Both the RIMH and the SIMH algorithms produced similar results 
for the example presented. The SIMH method has the advantage that 
it can be applied to any chain produced by a conventional MCMC sam- 
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Table 8. For the SIMH algorithm, 95% confidence interval for selected posterior 
quantities together with % coverage. 



V (L/nd) 95% Cl (% coverage) 



Quantity 


AUCt 


AUCa/ 


AUCh 


E(V) 




2.29, 2.33] (92%) 


[2.20, 2.22] (95%) 


[1.67, 1.70] (94%) 


Q(.025) 




1.82, 1.89] (92%) 


[1.85, 1.89] (91%) 


[1.44, 1.52] (93%) 


Q(.05) 




1.87, 1.95] (94%) 


[1.91, 1.92] (93%) 


[1.47, 1.53] (90%) 


Q(-5) 




2.27, 2.31] (91%) 


[2.19, 2.22] (94%) 


[1.67, 1.70] (96%) 


Q(-95) 




2.71, 2.84] (93%) 


[2.50, 2.56] (92%) 


[1.85, 1.87] (85%) 


Q(.975) 




2.84, 2.91] (94%) 


(2.57, 2.65] (95%) 


[1.87, 1.93] (90%) 



CL (L/rrd/hr) 95% Cl (% coverage) 



Quantity 


AUCl 


AUCa/ 


AUC H 


E(CL) 


[0.921, 0.936] (88%) 


[0.668, 0.680] (96%) 


[0.233, 0.244] (93%) 


Q(.025) 


[0.665, 0.699] (93%) 


[0.453, 0.491] (94%) 


[0.137, 0.168] (93%) 


Q(.05) 


[0.702, 0.741] (93%) 


[0.491, 0.518] (95%) 


[0.161, 0.181] (95%) 


Q(5) 


[0.923, 0.951] (91%) 


[0.664, 0.687] (88%) 


[0.233, 0.246] (93%) 


Q(.95) 


[1.10, 1.14] (95%) 


[0.828, 0.855] (88%) 


[0.305, 0.316] (91%) 


Q(.975) 


[1.14, 1.18] (91%) 


[0.861, 0.888] (89%) 


[0.317, 0.327] (89%) 



AUC(0,24) (^g-hr/ml) 95% Cl (% coverage) 



Quantity 


AUCl 


AUCa/ 


AUCh 


E(AUC) 




551, 560] (90%) 


[670, 677 


(95%) 




893, 903] (96%) 


Q(.025) 




425, 444] (87%) 


[544, 559] 


(90%) 




773, 814] (89%) 


Q(.05) 




445, 459] (93%) 


[560, 575 


(87%) 




805, 829] (91%) 


Q(.5) 




543, 558] (90%) 


(667, 677 


(92%) 




888, 902] (96%) 


Q(.95) 




667, 679] (94%) 


[766, 779 


(94%) 




978, 999] (92%) 


Q(.975) 




681, 700] (93%) 


[780, 793] 


(95%) 


[990, 1020] (92%) 



pier. After the basic chain is generated, the splitting algorithm can be 
adjusted by changing the splitting constant so as to partially control 
the number and length of the subchains. However, if the original chain 
mixes slowly it may not be possible to set the splitting constant to pro- 
duce an adequate number of subchains to apply the ratio estimator for 
calculation of confidence intervals, given the total number of samples in 
the original chain. In such cases it may be necessary to rerun the basic 
sampler with a larger number of samples. 

Implementation of the RIMH sampler also requires some experimen- 
tation on the part of the user to select and appropriate regeneration 
constant. An advantage of the RIMH method is that the regeneration 
constant can be selected and tested using short runs of the RIMH al- 
gorithm until an adequate average length of the subchains is produced. 
Then the algorithm can be run until a sufficient number of subchains 
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is generated for a given level of confidence in the particular posterior 
quantitiy of interest - a procedure which lends itself to automation. We 
note however that the RIMH method (like the SIMH method) will not 
improve mixing behavior of a slowly mixing process. 
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INTRODUCTION 

Messenger RNA expression profiling with DNA arrays, which are ca- 
pable of providing simultaneous measurements of the levels of large num- 
bers of messenger RNA species, are increasingly recognized as being an 
appropriate and particularly suitable technique for obtaining significant 
insights into a variety of disease and treatment effects. Although the 
use of arrays can provide useful insights when combined with existing 
analysis, visualization and bioinformatics software, there are additional 
biomedical information science research challenges that are not ade- 
quately met with the currently available tools [1,2]. One such problem 
- multi-dimensional visualization - is the focus of this chapter. 

Visualization can also provide effective tools to summarize and inter- 
pret data sets, describe the contents, and expose features in time series 
data. One of the key obstacles of visualizing array data is the high di- 
mensionality of gene expression profiles. Our approach, VizStruct, is 
motivated by radial visualization techniques such as Radviz [3,4] and 
in this report we will present several case study results for visualizing 
array-derived data and summarize some underlying theory. 
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MODELING AND RESULTS 

The Mapping 

At the core of VizStruct is a projection that maps the n-dimensional 
vectors into 2-dimensional points while retaining correlation similarity in 
the original input space [3, 4], If the vector x[n] = (x[0], x[l], . . . , x[n— 1]) 
represents a data item in n-dimensional space, R n , its mapping to a point 
Fi(x[n]) in the complex plane C is given by: 

n(xM) = X>W<r 2 '«'“ (l) 

3 = 0 

The real and imaginary components of Fi(x[n]) are used for creating 
the 2-dimensional mapping. In the equation above, i = \f—\ and the 
complex exponential has the effect of dividing the circle of display into 
equally-spaced sectors. 

Equation 1 represents a substantive reformulation of the usual radial 
visualization mapping, which uses equivalent vector nomenclature. From 
the theoretical standpoint, we find that the complex number notation 
has significant advantages and allows easier derivations of the theoretical 
underpinnings. For example, on closer inspection the mapping Fi(x[n]) 
is seen to be equivalent to the first harmonic of the Discrete Fourier 
Transform (DFT). The relationship between the DFT and the mapping 
allows the fast Fourier transform algorithm to be used for computa- 
tion. The FFT is computationally very efficient and has a complexity of 
O(nlogn). The VizStruct mapping preserves the correlation relationship 
between vectors in the output space. For example, all vectors of the form 
(a, a, ... , a) would be mapped to the center of the unit circle. If two vec- 
tors X and Y share geometric similarity, i.e., Y = aX where a is scalar, 
they will map to a radial line. All vectors with the same pattern will 
map to the same line and those that are similar will map to a narrow 
region around the line. 

Data sets and Classification 

Six diverse publicly available data sets were analyzed using the 
VizStruct mapping. The descriptions and sources of these data are sum- 
marized in Table 1. The Significance Analysis of Microarrays (SAM) al- 
gorithm [5] was used for identifying subsets of informative genes for the 
data sets. The OC1 (Oblique Classifier 1) algorithm (6], which constructs 
straight lines of arbitrary slope to separate classes of data, was used to 
identify good oblique splits at each node. The accuracy of the classifier 
was evaluated using the hold-out and leave-one-out cross-validation tech- 
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to identify a 101 gene informative subset and the results using VizStruct 
are summarized in Fig. 1. The normal tissues were clearly separated 
from the colon cancers and the 2-class structure underlying the multi- 
dimensional gene expression profile was evident in the mapping. The 
distinct separation between the 2 groups resulted in accuracy of 100% 
(Table 2). 



x10 4 




Fig. 1. Figure 1 shows the VizStruct representation of the gene expression data 
from Notterman et al [8]. The 18 colon cancer samples are shown as open circles 
and the 18 controls are shown as filled circles. The x-axis is the real part of the first 
Fourier harmonic, the y-axis is the imaginary part of the first Fourier harmonic. 



Chen et al. examined the gene expression profiles of liver tissues 
from 82 liver cancers and 72 non-tumor tissues [9]. The SAM algorithm 
was used to identify a 106 gene informative subset and the results from 
VizStruct are summarized in Fig. 2. The liver tumors were clustered 
distinctly from the non-tumor tissues. There was overlap at the bound- 
ary of the oblique classifier and there were 2 errors, which resulted in an 
overall accuracy of 94% and a k coefficient of 0.89. 
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Table 2. Summary of results from sample classifications. 



Data Set 


Classes 


Genes in 
Classifier 


Evaluation 


Colon cancer 


2 


240 


Leave-one-out 


Liver cancer 


2 


106 


Hold-out 


Ovarian cancer 


2 


101 


Hold-out 


Melanomas 


3 


103 


Leave-one-out 


Pancreatic cancer 


3 


101 


Leave-one-out 


Soft-tissue cancer 


5 


102 


Leave-one-out 



Data Set 


Training Set 
Size 


Testing Set 
Size 


Error 

Count 


Accuracy 


K 


Colon cancer 


36 


36 


0 


100% 


1.00 


Liver cancer 


120 


36 


2 


94% 


0.89 


Ovarian cancer 


90 


35 


2 


94% 


0.89 


Melanomas 


38 


38 


4 


89% 


0.82 


Pancreatic cancer 


36 


36 


2 


94% 


0.89 


Soft-tissue cancer 


46 


46 


4 


91% 


0.88 



As a third case study, we examined the results of Schaner et al. who 
compared the gene expression profiles of ovarian cancers to breast can- 
cers [10]. The data set consisted of 125 sample x 3363 genes consisting 
of 57 ovarian and 68 breast cancer samples. The SAM algorithm was 
used to identify a 101 gene informative subset. Figure 3 summarizes 
the results from VizStruct; the ovarian and breast cancer profiles were 
separate in the mapping and there were 2 errors and the accuracy was 
94%. 

Visualization of Multi-Class Data Sets 

Bittner et al. analyzed 31 melanomas and compared their gene ex- 
pression profiles to 7 non-cancerous controls [11]. Their clustering results 
indicated 2 distinct clusters of melanomas, 19 samples comprised the ma- 
jor cluster and the remaining 12 were in the minor cluster. For purposes 
of visualization, we obtained an informative subset of 103 genes using 
the SAM algorithm and treated the 2 melanoma clusters and controls 
as different classes. The VizStruct results, which are shown in Fig. 4, 
indicated that the major melanoma cluster was well separated from the 
remaining samples but there was partial overlap between samples in the 
minor cluster and the controls. The overall accuracy for the melanoma 
data set using the oblique classifiers shown was 89% and the K coefficient 
was 0.82 (Table 2). 
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Fig. 2. Figure 2 shows the VizStruct representation of the gene expression data 
from Chen et al. [9], The liver cancer samples sire shown as open circles and the 
controls are shown as filled circles. The line across the graph is the oblique classi- 
fier used; the x-axis is the real part of the first Fourier harmonic, the y-axis is the 
imaginary part of the first Fourier harmonic. 



Figure 5 summarizes the VizStruct results from the pancreatic cancer 
gene expression profiles of Iacobuzio-Donahue et al. [17]; the 3-classes 
in the data set are pancreatic cancer cell lines, pancreatic cancers and 
normal pancreatic tissue and an informative gene subset containing 101 
genes, obtained using the SAM algorithm, was used. There were 2 errors; 
the overall accuracy was 94% and the k coefficient was 0.82 (Table 2). 

Nielsen et al. examined several diverse classes of soft- tissue tumors, 
which are derived from mesenchymal cells [18]. They found that synovial 
sarcomas, gastrointestinal stromal tumours, neural tumours, and some 
leiomyosarcomas had distinct gene-expression patterns. The malignant 
fibrous histiocytoma, liposarcoma, and the remaining leiomyosarcomas 
had unexpected similarities. Using hierarchical clustering, these authors 
identified 5 sample clusters, which were examined using VizStruct in Fig. 
6. The VizStruct approach had an accuracy of 91% and the k coefficient 
was 0.88 (Table 2). 
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Fig. 3. Figure 3 shows the VizStruct representation of the gene expression data 
from Schaner et al. (10). The ovarian cancer samples are shown as open circles and 
the breast cancer are shown as filled circles. The line across the graph is the oblique 
classifier used; the x-axis is the real part of the first Fourier harmonic, the y-axis is 
the imaginary part of the first Fourier harmonic. 




Fig. 4. Figure 4 shows the VizStruct representation of the gene expression data 
from Bittner et al. (11). The minor cluster of 12 melanoma samples is shown as filled 
circles; the major cluster of 19 melanomas is shown as squares and the 7 controls me 
shown as open circles. The lines aero® the graph is the oblique classifier used; the 
x-axis is the real part of the first Fourier harmonic, the y-axis is the imaginary part 
of the first Fourier harmonic. 



DISCUSSION 

Here we have demonstrated the usefulness of a novel visualization tool, 
VizStruct, for visualizing high-dimensional gene expression profiles in 2 
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Fig. 5. Figure 5 shows the VizStruct representation of the gene expression data 
from Iacobuzio-Donahue et al. (17). The 12 pancreatic cancer cell lines samples are 
shown as filled circles; the 19 pancreatic cancer tissues are shown as squares and the 
5 control pancreas are shown as open circles. The lines across the graph is the oblique 
classifier used; the x-axis is the real part of the first Fourier harmonic, the y-axis is 
the imaginary part of the first Fourier harmonic. 



dimensions. We demonstrated that the VizStruct mapping is effective 
for visualization-driven classification in several 2-class and multi-class 
data sets. 

Our theoretical analyses indicate a direct relationship between the 
radial visualization approach and the discrete Fourier transform (DFT). 
Although the DFT, more specifically the periodogram, is used widely 
in spectral analysis [ 12], our adaptation of the DFT uses the amplitude 
and phase of the first Fourier harmonic, and its use for multi-dimensional 
visualization has not been explored in detail. 

Parallel coordinates plots, in which the data along each dimension is 
plotted on a separate axis [13] are the simplest way to display gene ex- 
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Fig. 0. Figures 6 shows the VizStruct representation of the gene expression data 
from soft tissue cancers from Nielsen et al. [18]. The legend symbols are: nerve 
sheath tumors or schwannomas (open squares); synovial sarcomas (filled circles), gas- 
trointestinal stromal tumours (open diamonds); a subset of distinct leiomyosarcomas 
(open circles); the liposarcomas, malignant fibrous histiocytoma and the remaining 
leiomyosarcomas are combined as a single group (triangles). The lines across the 
graph is the oblique classifier used; the x-axis is the real part of the first Fourier 
harmonic, the y-axis is the imaginary part of the first Fourier harmonic. 



pression time series: For example, Holter et al. used this method to plot 
yeast cell cycle data [14]. Heat maps are also commonly employed in ar- 
ray data analysis. On closer inspection, however, it becomes evident that 
the heat plot is a special version of the parallel coordinates plot. Both 
parallel coordinates and heat plots suffer from the same problems: both 
work best only when there are few dimensions or only when combined 
with clustering; otherwise, the visualization becomes rapidly cluttered 
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and generally less useful. More recently, Hautaniemi et al. [15] have 
proposed a heat map based strategy for visualization of the U-matrix 
from self-organizing maps (SOM). Visualization of raw and processed 
intensities is fairly straightforward and most commercial array software 
packages offer it; each spot on the array is represented using pseudo 
color and can be linked to metadata, e.g., gene accession numbers, gene 
titles, etc. An example of a program that adds features to this class of 
array visualization problem is AVA [16]. The focus of VizStruct is very 
different from these commonly used methods: it focuses on multidimen- 
sional data visualization and this important aspect of array analysis has 
generally, received little attention. The results from direct comparisons 
(results not shown) indicate that VizStruct offers substantive and unique 
advantages over both parallel coordinate and multi-dimensional scaling 
methods. 
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niques. The hold-out method involves constructing the classifiers using 
a subset of training samples and assessing the accuracy using a mutually 
exclusive subset of the test samples. In the leave-one-out method, the 
classifiers were constmcted using all the data points except one, which 
was withheld and used for testing. The steps of classifier construction 
and testing were repeated for all the data points in the data set. The 
accuracy of the classifier predictions was assessed using overall error rate 
(the ratio of the number of errors to the total number of test samples) 
and the coefficient. The coefficient [7] is defined as: 



where po is the proportion of cases in which the raters agree and p c is 
the proportion of cases for which agreement is expected only by chance 
on the assumption that the marginal distributions of ratings are inde- 
pendent. The k coefficient is a more appropriate measure of accuracy of 
classification for multi-class data sets. 



Table 1 . Sources of the published data sets used. 



Data Set 


Data Size 
Genes x Samples 


Reference 


Colon cancer 


7464 x 36 


[8] 


Liver cancer 


3964 x 156 


[9] 


Ovarian cancer 


3363 x 125 


[10] 


Melanomas 


8067 x 38 


[11] 


Pancreatic cancer 


1492 x 36 


[17] 


Soft tissue cancer 


5878 x 46 


[18] 



Data Set 


URL from which data set was obtained 


Colon cancer 


http://microarray.princeton.edu/oncology/ 


Liver cancer 


http://genome-www.stanford.edu/hcc/supplement.shtml 


Ovarian cancer 


http://genome-www.stanford.edu/ovarian cancer/data.sh- 
tml 


Melanomas 


http://www.nature.com/nature/journal/v406/n6795/supp- 

info/406536a0.htinl 


Pancreatic cancer 


http://genome-www.stanford.edu/pancreaticl/data.shtml 


Soft tissue cancer 


http://genome-www.stanford.edu/sarcoma/download.shtml 



Visualization of 2-Class Data Sets 

Notterman et al. examined the gene expression profiles in 18 colorectal 
cancer and 18 normal tissue samples (8]. The SAM algorithm was used 
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INTRODUCTION 

Treating HIV infection is challenged by the rapid turnover and high 
reverse transcription errors of HIV viruses. A significant proportion 
of patients eventually fail to control viral replication and develop re- 
sistant viruses during treatment with most of the highly potent an- 
tiretroviral therapies (HAART). However, no primary or active site mu- 
tation/resistance to protease has been noted to date in Phase II and 
ITT antiretroviral (ARV)-naive adult and pediatric subjects treated with 

lopinavir/ritonavir (LPV/r, or Kaletra®) + 2 nucleoside reverse tran- 
scriptase inhibitors (NRTIs) [1-4]. In particular, in an Abbott Phase 
III trial with 653 patients rando miz ed to receiving LPV/r or nelfinavir 
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(NFV) with two NRTIs, d4T and 3TC, 123/327 subjects receiving NFV 
had HIV RNA >400 copies/mL during the week 24-108 period. Ninety- 
six of the 123 subjects with viral rebound had genotype data; 45% of 
these (43/96) developed primary mutation to protease, 82% (79/96) had 
3TC resistance, and 9% (9/96) had d4T resistance [1]. In the same study, 
74/326 subjects receiving LPV/r also had HIV RNA >400 copies/mL 
and 51 of the 74 subjects with viral rebound had genotype data. In- 
terestingly, none of these developed primary mutations to protease or 
d4T resistance, and only 37% (19/51) had 3TC resistance. It is impor- 
tant to note that adherence rates were similar between the two study 
regimens. The absence of resistance to protease inhibitors (Pis) in the 
treatment-naive patient population treated with LPV/r motivated this 
analysis. 



HYPOTHESIS 

The absence of resistance emergence to LPV/r in the treatment-naive 
patient population is hypothesized to be due to the following reasons. 

1. The median inhibitory quotients (IQ = C trough /ICso) in Pi-naive 
patients receiving LPV/r and NFV are substantially different, be- 
ing 91 (N=30) and 3.3 (N=91), respectively, based on protein- 
binding corrected IC 50 values [5,6]. However, resistance develops 
to drugs with even higher IQ values, such as efavirenz (EFV), a 
non-nucleoside reverse transcriptase inhibitor (NNRTI) [7]. Thus, 
the lack of resistance to LPV/r cannot be fully explained by high 
IQ values. 

2. Multiple steps of mutations are probably required to develop re- 
sistance to Pis with high IQ values, such as LPV/r [1,6, 8, 9]. Lit- 
erature data show that clinical isolates from patients treated with 
(NNRTIs, such as EFV, and some NRTIs, such as 3TC, often have 
large increases in /C 50 values (i.e., large resistance step sizes), 
while isolates from patients treated with Pis tend to show smaller, 
incremental increases in IC 50 values (i.e., small resistance step 
sizes) [1,7, 8, 9-12]. In the absence of data on resistance develop- 
ment in LPV/r-treated patients, results from in vitro experiments 
suggest that mutation step sizes for LPV likely are 2-3 fold [1,8,9]. 

3. Additionally, the terminal-phase elimination of LPV/r is very rapid 
when given alone or during missed doses, which theoretically can 
reduce the time during which a patient is exposed to sub-therapeutic 
concentrations and high selection pressure after missing doses. 
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When administered alone LPV is very rapidly metabolized by cy- 
tochrome P450 3A enzymes (CYP3A) [13,14]; however, LPV is 
co-formulated with a potent enzyme inhibitor ritonavir (RTV), al- 
lowing a 1 1/2 of 9-10 hours during routine BID dosing intervals 
[15,16]. The terminal t\/ 2 is less than 2 hours during missed doses, 
due to simultaneously decreasing concentrations of RTV. 



METHODS AND MATERIALS 

A differential equation model of HIV therapy integrating PK, adher- 
ence, and viral replication and resistance development dynamics was 
constructed via the Pharsight clinical trial simulator. This model al- 
lows the assessment of resistance development in any defined population 
under clinical conditions. 

PK Model 

The PK model accounts for: 

1. dose-dependent absorption of LPV: 

FDAlpv — D50Alpv / (D50Alpy + Dlpv ) 

2. saturable gut metabolism of RTV: 

FD Artc — &luv / (D^OA'jfTv + Dfirv) 

3. enzyme induction of LPV by both LPV and RTV: 

^ = -k el ■ I{t) + k el ■ (1 + a • C LPV ( t) + b ■ Cfirv (t )), / (0) = 1 

4. and inhibition of LPV clearance by RTV: 

CLlpv = CLOlpv •/(<)• K\/{Ki -I- Cr tv (<)) 

Where FDA is fraction of dose absorbed, D50A is the dose at which 
50% of the administered dose is absorbed, D is dose, 7 is a sigmoid 
factor, I is extent of induction relative to the non-induced state, CLO is 
clearance under non-induced states, k e [ is the elimination rate constant, 
a and b are constants, and Ki is the inhibition constant of RTV. 

The PK parameters of LPV and RTV were obtained by fitting Phase 
I/II PK data (N=263) using NONMEM. In trial simulations, the distri- 
butions of LPV and RTV PK parameters were based on the NONMEM 
posthoc estimates and modeled as lognormal. Residual error was also 
modeled as lognormal. The PK parameters of other anti-HIV Pis and 
NNRTIs were modeled based on data reported in the respective product 
labels. For NRTIs, since most regimens included two NRTIs, the PK 
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model treated the two NRTIs as one drug, with an elimination t\j 2 of 
15 hr (representative of lamivudine), and C trough — /Cs 0 , or IQ = 1. 

Adherence Model 

Based on dosing records collected in the clinical trials, the dose-taking 
patterns for anti-HIV drugs can be described adequately for current pur- 
poses by an independent-events (single-coin) model, and population ad- 
herence to anti-HIV drugs can be described by a beta-distribution model. 
The beta-distribution model was categorized in increments of 10%, by 
breaking the 0-100% adherence range into sub-ranges which surround 
each category. For example, the 70% adherence category represents the 
sub-range of [65%, 75%]. The distribution parameters for QD, BID, 
and TID regimens differ. These models appeared to be consistent with 
LPV/r Phase III MEMS cap adherence data. 

PD Model 

The PD model follows the change of healthy T cells (T), actively 
infected T cells (TA), long-lived infected T cells (TL), infectious virus 
(V), and non-infectious virus (W) during therapy. The PD model has 
the ability to simultaneously monitor the evolution of multiple strains 
of viruses, with each virus strain being characterized by three dimen- 
sions of mutation. Strain Vijk represents virus with mutation state i 
on protease, mutation state j on reverse transcriptase due to NRTI, 
and mutation state k on reverse transcriptase due to NNRTI. With the 
exception of wild type virus, these viral strains are pre-exiting at low 
levels in treatment-naive patients. The viral dynamics model is based 
on previously published models [17-19]. The resistance model is coded 
in Fortran and was linked to a special version of Pharsight Trial Sim- 
ulator. Below are representative differential equations for a simplified 
version with two strains of infectious viruses, wild type (Vj)and mutant 
(V 2 )) and non-infectious viruses, wild type (tzq) and mutant (w 2 ). 

= \-dT-(l-g 1 )/3 1 V 1 T-(l-g 2 )0 2 V 2 T 

= Qa( 1 ~ 9i) 0\ (1 - Mi) V\ T + q A (\ - g 2 ) 0 2 g. 2 V 2 T - ciaTai 

= QL (1 ~ 9\) A (1 - Mi) V\T + ql (1 - 52 ) P 2 H 2 V 2 T - aiTi\ 

= (1 ~ /1 ( Crtv,Clpv )) (k aTa\ + kiTu) - cV] 

— h {Crtv,Clpv) ^aTax + kiTpi) - cW\ 



dT 

dt 

dTA 1 

dt 

dT Ll 

dt 

dVi 

dt 

dWi 

dt 




215 



Analogous equations can be written for Ta?, Tl 9 , V 2 , and W 2 . In 
the above equations, A is CD4 T cell production rate, d is CD4 T cell 
death rate, g is fractional inhibition by NRTI orNNRTI drugs, (3 is viral 
infectivity, qa and qi (= 1 — qA) are probabilities that an infected CD4 
cell is active and long-lived, respectively; y.j is fraction of wild-type (j=l) 
or mutant (j=2 ) vims mutating to the other type during replication, a 
is death rate of infected cells, f is fractional inhibition by Pis, k is viral 
production rate, and c is death rate of virus. 

Viral fitness, infectivity, and mutation rate for each strain of virus can 
be specified in the model simulation. During combination therapy with 
two Pis such as LPV/r, the drug effects are assumed to be additive. 



SIMULATION-HYPOTHESIS ASSESSMENT 

The HIV PK/PD model was used to estimate the probability of resis- 
tance development for three hypothetical drugs with different IQ values, 
resistance characteristics, and drug elimination half-lives over a wide 
range of adherence levels (Table 1). The simulation was conducted for 
200 patients under treatment for 48 weeks. The simulation assumed 
that drug-resistant mutants have similar infectivity and have 70% of 
the fitness of wild-type vims. A subject is defined as having developed 
PI resistance if the subject has plasma HIV RNA >400 copies/mL at 
48 weeks and has PI resistance (for LPV-like and NFV-like drugs) or 
NNRTI-resistance (for EFV-like drugs) virus accounting for more than 
20% of the total viral load. 



Table 1 . Pharmacokinetic/pharmacodynamic characteristics of three model drugs. 





Lopinavir-like 


Nelfinavir-like 


Efavirenz-like 


Median IQ 


80 


3 


450 


Terminal t 1/2 


l-2hr/9-10hr* 


4 hr 


50 hr 


Resis. step size 


2,4,8,26X 


2,4,8,26X 


20,100,200X 



"Modeled based on Lopinavir/ritonavir Phase I, II and III PK data; LPV tj / 2 is 
about 1-2 hr during missed doses and 9-10 hr during steady-state. 



RESULTS 

As shown in Fig. 1, the probability of resistance development as 
a function of adherence rate is distinctly different for the three model 
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drugs. For the population with adherence rates >50%, the probability 
of resistance development is the lowest for LPV/r-like drugs. For the 
population with adherence rates <50%, the probability of resistance de- 
velopment is the highest for EFV-like drugs. For NFV-like drugs, the 
probability of developing resistance is highest when the adherence rates 
are higher than 50%; however, the probability of resistance development 
decreases rapidly when the adherence rates are less than 50%. Both 
LPV/r-like and NFV-like drugs have very low probability of producing 
resistance when adherence rates are <20%. It should be noted that the 
adherence rates were >80% in the Phase II and III trials of LPV/r. The 
simulation results suggest that under high adherence rates, the proba- 
bility of developing PI resistance is very low for LPV/r-like drugs, which 
is consistent with clinical observations. 



50% 



C 40% 
c 
3 
*2 

8 
a : 

oO 30% 



20% 



10% 



0% 



0% 




Fig. 1 . Effect of pharmacokinetic and resistance characteristics on viral load and 
resistance development for three model drugs. 



DISCUSSION 

Assuming adherence rates during clinical trials are higher than 50%, 
the results of the simulations indicate that NLV-like drugs would have 
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the highest probability of producing drug resistance, followed by EFV- 
like drugs, and then LPV-like drugs. This agrees with the clinical trial 
data observed to date. It should be noted that for simplicity, the simu- 
lation performed in this analysis assumed that both LPV and NFV have 
identical sequential resistance step sizes. However, in vitro experiments 
and analysis of c lini cal isolates from Pi-experienced patients indicate 
that the resistance step sizes likely are 2-3 fold for FPV and up to 5-6 
fold for NFV [1], Thus, the simulation analysis may underestimate the 
difference between FPV and NFV in genetic banders to the emergence 
of resistance. Also, it is important to note that when FPV/r is used 
to Feat Pi-experienced patients with reduced sensitivity to FPV/r (i.e., 
reduced IQ), further protease mutations have been ob saved [20]. 

It is interesting to note that the commonly perceived benefit of having 
a long elimination half-life may actually increase the probability of resis- 
tance development during inconsistent adherence, particularly when the 
resistance step size is large. As shown in Fig. 2, assuming similar fitness, 
mutants with large resistance step sizes have higher selective advantages 
over a wider concentration range as compared to those with small resis- 
tance step sizes. This effect will be exaggerated for drugs with a long 
elimination t\/ 2 , because subjects will be under high selection pressures 
for a prolonged duration during missed doses. This probably explains 
why clinical isolates with resistance to EFV in NNRTI-naive subjects 
have been reported despite large IQ values and a long elimination t 1( / 2 - 

This model was primarily based on Phase II/III clinical trial data of 
RTV and FPV/r and was subsequently validated with the results of a 
separate c lini cal trial of FPV/r [21]. Although we have also tested the 
model by simulations of reported cli ni cal trials of other Pis and NNRTIs 
(results not shown), it is apparent that this model does not address the 
totality of viral dynamics and immune response during anti-HIV therapy. 
This simplified model can be improved in many aspects. For example, a 
more quantitative model of the effect of NRTIs in HAART is desirable; 
however, quantitative data on the relationship between pharmacokinet- 
ics and pharmacodynamics for NRTIs is still lacking. Also, sensitivity 
analysis indicates that viral fitness plays an important role in viral load 
and resistance development. Therefore, it would be preferred if fitness 
data can be generated systematically using a standardized method. Sim- 
ilarly, infectivity data should also be generated systematically, although 
our sensitivity analysis suggested that viral load and resistance develop- 
ment is more sensitive to viral fitness than to viral infectivity. 
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IC50mu: IC K of mutants. 

Selection pressure was calculated as the difference in % residual replication between mutants 
and wild-type virus. 



Fig. 2. Effect of mutation step size on selective pressure. 

CONCLUSIONS 

The integrated HIV therapy model allowed a clinically relevant si- 
multaneous assessment of multiple factors critical to viral load rebound 
and resistance development in HIV therapy. The simulations of three 
distinctive model drugs suggest that the lack of PI resistance develop- 
ment in a Pi-naive population treated with LPV/r + 2 NRTIs may be 
due to the following characteristics. First, LPV in vivo IQ is adequate 
to suppress the growth of mutants. Second, LPV probably has small 
resistance steps (based on in vitro data), and hence, narrow selective 
concentration windows with low pressures toward selection of mutants. 
Finally, LPV has an accelerated elimination half-life during missed doses 
due to simultaneously declining RTV concentrations. Rapid decline of 
LPV concentrations allows short residence time at concentrations that 
are the most selective for resistant mutations. 
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INTRODUCTION 

Antiretroviral treatment of human immunodeficiency virus type 1 
(HlV-l)-infected patients with highly active antiretroviral therapies 
(HAART) consisting of reverse transcriptase inhibitor (RTI) drugs and 
protease inhibitor (PI) drugs results in several orders of magnitude re- 
duction of viral load. The decay in viral load happens in a relatively 
short-term [1,2,3], and it can be either sustained up to one year or may 
be followed by resurgence of virus within months. The resurgence of 
virus may be caused by drug resistance, non-compliance, pharmacoki- 
netics problems and other factors during treatment process. 

Many HIV dynamic models have been proposed by biomathematicians 
in the last decade to provide theoretical principles to guide the develop- 
ment of treatment strategies for HIV-infected patients [1-10] and have 
been used to quantify short-term dynamics, but these models are rarely 
applied to model virological response of long-term treatment. In other 
words, these models were only used to fit the early segment of the viral 
load trajectory data and were limited to interpret typical HIV dynamic 
data resulting from AIDS clinical trials since the viral load trajectory 
may change to different shapes in later stage due to drug resistance, non- 
compliance and other clinical factors. In this paper, we put forward a 
set of relatively simple models to characterize long-term virus dynamics 
which incorporate different factors associated with resurgence of viral 
load: the intrinsic nonlinear HIV dynamics, drug exposure and in par- 
ticular compliance to treatment, and drug sensitivity. Thus, the models 
are flexible to quantify the long-term HIV dynamics. In this paper, we 
also propose a Bayesian hierarchical nonlinear mixed-effect model ap- 
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proach to estimate the dynamic parameters in a system of differential 
equations with time-varying coefficients. 

The paper is organized as follows: Section 2 introduces a simplified 
viral dynamic model with time-varying drug efficacy which incorporates 
the effect of pharmacokinetic (PK) variation, drug resistance and ad- 
herence. The hierarchical Bayesian methods for estimating dynamic 
parameters are discussed in Section 3. Section 4 presents pharmacoki- 
netics data, drug resistance phenotype data, and adherence data as well 
as the viral load data from an AIDS clinical trial and implement the es- 
timation methods to fit the clinical data. Finally we conclude the paper 
with some discussions in Section 5. 

HIV DYNAMIC MODELS AND EFFICACY 
MODELS 

Mathematical Models for HIV Infection 

The basic dynamic model is a model that describes the population 
dynamics of HIV and its target cells in plasma. The mathematical details 
of this model have been presented elsewhere [10-14]; thus, we provide a 
brief summary here. In the model, we consider target uninfected cells 
(T), infected cells (T*) that produce virus, and free virus (V). This 
model differs from the previous models in that it includes a time-varying 
parameter 7 (f), which quantifies antiviral drug efficacy. The model is 
expressed in terms of the following set of differential equations under the 
effect of antiretroviral treatment [15]. 

~T = A — pT — [1 — ~i{t)]kTV 

jT * = [1 - 7(f)]fcTV - ST* 

j t V = NST* - cV (1) 

where A represents the rate at which new T cells are created from sources 
within the body, such as the thymus, p is the death rate per T cell, k is 
the rate at which T cells become infected by virus, 6 is the rates of death 
for infected cells, N is the number of new virions produced from each 
of infected cells during their life-time, and c is the clearance rate of free 
virions. The time-varying parameters 7 (t) is the time varying antiviral 
drug efficacy which is referred to as Efficacy Index (El), as defined later. 
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If the drug is not 100% effective (not perfect inhibition), the sys- 
tem of ordinary differential equations can not be solved analytically. 
The solutions to Eq. (1) then have to be evaluated numerically. Let 
0 = (<j>, c, S, A, p, N, k) T denote a set of parameter values, where <j) is a 
parameter in the drug efficacy model (see next section for details), and 
logi 0 (Vp (0, t)) denote the common logarithm of the numerical solution 
of V(t) for the ith individual at time tj, which is the quantity we mea- 
sured in plasma and will be used for dynamic parameter estimation. 

Similar to the analysis in [10], it can be shown from Eq. (1) that if 
7(0 > % (Cc is a threshold of drug efficacy) for all t , where 



e c = 1 - 



cp 

kNX' 



( 2 ) 



then in theory virus will be eventually eradicated. However, if 7 (t) < e c 
(treatment is not potent enough) or if the potency falls below e c before 
virus eradication (due to drug resistance, for example), viral load may 
rebound (see [10] for a detailed discussion). Thus, the efficacy threshold 
e c may reflect the immune response of a patient for controlling virus 
replications by the patient’s immune system. It is important to estimate 
e c for each patient based on clinical data. 



Antiviral Drug Efficacy 

Within the population of HIV virions in a human host there is likely 
to be genetic diversity and corresponding diversity in sensitivity to the 
various antiretroviral drugs. In some clinical studies, genotype or phe- 
notype tests are conducted to determine the sensitivity of antiretroviral 
agents before a treatment regimen is selected. Here we use the pheno- 
type marker /C 50 [16], which represents the dmg concentration neces- 
sary to inhibit viral replication by 50%, to quantify agent- specific drug 
sensitivity. Herein we refer to this quantity as the median inhibitory 
concentration. To model the within-host changes over time in JC50 due 
to the emergence of new drug resistant mutations, we use the following 
function [ 10 ], 



/C M(( ) (3) 

where Iq and 7 r are respective values of IC^(t) at baseline and time 
point t r at which the resistant mutations dominate. If 7 r = 7 0 , no new 
mutation is developed during treatment. Although more complicated 
models for median inhibitory concentration have been proposed based on 
the frequencies of resistant mutations (likely to arise from the regimen) 
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and cross-resistance patterns [17,18], in clinical studies it is common to 
measure /C50 values only at baseline and failure time [16]. Thus, this 
function may serve as a good first approximation. 

Poor adherence to a treatment regimen is one of the major causes of 
treatment failure [19,20]. Patients may miss occasional doses, may mis- 
understand prescription instructions or may miss multiple consecutive 
doses for various reasons; these deviations from prescribed doses affect 
drug exposure in predictable ways. 

In recent years, antiretroviral drugs have been developed rapidly. 
Three types of antiretroviral agents, nucleoside and non-nucleoside re- 
verse transcriptase inhibitors (RTI), as well as protease inhibitors (PI) 
have been widely used in developed countries. Other types of antivi- 
ral agents such as infusion inhibitors are also in an active development 
stage. Highly active antiretroviral therapy (HA ART), containing three 
or more antiretroviral drugs, has proven to be extremely effective at re- 
ducing the amount of virus in the blood and tissues of infected patients. 
In viral dynamic studies by Perelson and Nelson [2], Wu and Ding [3], 
Ding and Wu [21,22], investigators assumed that the drug efficacy was 
constant over treatment time. However, drug efficacy may vary as the 
concentrations of antiretroviral drugs and other factors ( e.g ., drug re- 
sistance) vary during treatment [2,21]. Also in practice, patients’ viral 
load may rebound during treatment and the rebound may be associated 
with resistance to antiretroviral therapy [23]. Here we use the following 
modified E max model [24] for the drug efficacy of two agents within a 
class (for example, protease inhibitor drugs), 

,.s _ + IQ 2 (t)A 2 (t) _ AIQijt) 4- AIQ 2 (t) 

7U ~ <t> + IQi(t)Ax(t) + IQ 2 (t)A 2 {t) ~ <f> + AIQ\(t) T AIQ 2 (t) 1 ’ 

where IQ l {t) = C\ 2h / IC^t) and AIQi(t) = IQi{t)A t (t) ( i = 1,2) 
denote the inhibitory quotient (IQ) [25,26] and adherence-adjusted in- 
hibitory quotient (AIQ), respectively; C\ 2h and ICl Q ( i = 1,2) are the 
drug concentration in plasma measured at 12 hours from doses taken 
and median inhibitory concentration for the two agents, respectively; <j> 
is a parameter to be estimated. Note that C\ 2h could be replaced by 
other PK parameters such as area under the plasma concentration-time 
curve (AUC) or trough level of drug concentration (C m in)- Theeffectof 
adherence, A{(t), is given by 

{ 1 for kT < t < (k + 1 )T 

and all doses are taken in [kT, (k + 1)T], 

Ri for kT <t<(k + 1 )T 

and 100.Rt% doses are taken in [kT, ( k + 1 )T], 



Ai{t) = 



( 5 ) 
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where 0 < Ri < 1 (i = 1,2) and T denotes the questionaire time to 
patients. Lack of adherence reduces the drug exposure, which can be 
quantified by Eq. (5), and thus reduces drug efficacy based on Eq. (4) 
and, in turn, can affect viral response. The time-varying parameter j(t) 
ranges from 0 to 1 and is referred to as the drug efficacy index (the 
inhibition rate of viral replication) in a viral dynamic (response) model. 
If 7(f) = 1, the drug is 100% effective, whereas if y(t) = 0, the drug 
produces no effect. More details regarding viral dynamic models and 
time- varying drug efficacy model can be found in Huang et al. [10]. Note 
that, if C\ 2fl , Ai(t ) and IC$ 0 (t) can be observed from clinical study and 
4> can be estimated from clinical data, then time-varying drug efficacy 
7(f) can be estimated during the course of antiviral treatment. 



STATISTICAL METHODS 

It is a great challenge to use the differential model Eq. (1) to fit clin- 
ical data and to predict antiviral response, because there are too many 
unknown parameters that need to be determined. We usually only have 
viral load data available from clinical trials (CD4 + T cell count data are 
considered too noisy to be used for dynamic parameter estimation). In 
this case, we may not be able to identify all the unknown parameters in 
the model given in Eq. (1). To deal with the unidentifiability problem, 
mathematicians simply substitute some of the unknown parameters with 
their estimates from previous studies or literature (see Perelson et al. [1] 
for example). Here we propose a Bayesian approach to tackle this diffi- 
culty. The advantages of Bayesian methods include (i) not only a point 
estimate, but also the uncertainty of the estimate from the previous 
studies or literature can be appropriately incorporated; (ii) the sparse 
data from some individuals can still be handled by borrowing informa- 
tion from the population; (iii) the Markov chain Monte Carlo (MCMC) 
methods can be easily used for computations, and closed-form solutions 
to the model Eq. (1) are not necessary; (iv) since complete posterior 
distributions for the unknown parameters can be obtained, it is easy to 
conduct inferences; (v) the unidentifiability problem can be resolved us- 
ing the population prior information. We propose a two-step Bayesian 
approach to estimate the unknown parameters for the model in Eq. (1) 
for all subjects. In the first step, we use a hierarchical Bayesian frame- 
work (Bayesian mixed-effects model) to incorporate the population prior 
for estimating prior for each individual, whereas, in the second step, the 
estimated prior for each individual from step 1 is applied to a standard 
Bayesian nonlinear regression model to obtain the final estimates of the 
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unknown parameters for each individual. This is an empirical Bayesian 
idea in spirit. A description of the methodologies is given below. 

Hierarchical Bayesian Modeling Approach 

A hierarchical (mixed-effects) Bayesian model can be used to incor- 
porate a prior at the population level into the estimates of dynamic 
parameters for individual subjects using the empirical Bayesian idea. 

Denote the number of subjects by n and the number of measurements 
on the ith subject by m l . In the model of Eq. (1), we are able to obtain 
measurements for viral load V(t) from AIDS clinical trials. For nota- 
tional convenience, let (j, — (log(^), log(c), log(<5), log(A), log(p), log (A), 
log(fc)) T , di = (i log{<t>i ), log{a),log{6i),log(\i),log(pi),log{Ni),log{ki)) T , 
0 = {0 U i = 1, . . . , n}, 0 {i} = {0 ( , l ± i} and Y = {yy , i = 1, • • ■ , n; j = 
1 , • • • , m t } . Let ) be the numerical solution of common loga- 

rithmic viral load log 10 (F(f)) to the differential equations in Eq. (1) 
for the ith subject at time tj. The repeated measurements of com- 
mon logarithmic viral load for each subject, yij(t), at treatment times 
tj, j = 1, 2, • • • rrii, can be written as 

yij(tj) = fij(Oi,tj) +ei(tj)> * = j = l, • • • m{ ( 6 ) 

where ej(i) is a measurement error with mean zero. Then the Bayesian 
nonlinear mixed-effects model can be written as following three stages 
[26]. 

Stage 1. Within-subject variation in common logarithmic 
viral load measurements: 



y i = f i{0i) + e*, [ei|cr 2 ,0i] ~ A/’(0,CT 2 I mi ) (7) 

where y 4 = (yu(ii), • ■ • ,yim i (tm i )) T , W) 

(/tl(^i) ^1 )> ' * ‘ i firm ttm )) i Gi = 

and the bracket notation \m denotes the conditional 
distribution of A given B. 

Stage 2. Between-subject variation: 

9i = /Lt + bi, (8) 



Stage 3. Hyperprior distribution: 

cr~ 2 ~ Ga(a, 6), p~A/’(» 7 , A), £ _1 ~ 



( 9 ) 
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where the mutually independent Gamma (Ga), Normal (M) and Wishart 
( Wi) prior distributions are chosen to facilitate computations [27]. The 
parameters o, 6, rj, A, O and v that characterize the hyperprior distribu- 
tions are assumed to be known from previous studies. 

The Markov chain Monte Carlo (MCMC) methods will be used to fit 
this three-stage hierarchical model (see below for details on the MCMC 
implementation). From this model-fitting, we expect to obtain the pos- 
terior distributions of unknown parameters (0 t ,i = 1, ••• n) for each 
individual. Under the normality assumption, it is enough to retain the 
posterior mean (denoted by $*) and diagonal variance matrix (denoted 
by A*). The estimated posterior distributions for individuals will serve 
as a prior for individual Bayesian nonlinear regressions in the second 
step introduced below. 

Note that the hierarchical model fitting may produce good estimates 
of population parameters, but it may not fit all individual data well, 
in particular, when the be tween- subject variation is large [28]. This is 
because the objective of fitting the hierarchical model is to minimize 
overall risk of all subjects. Thus, some subjects whose data are away 
from the population mean may not get a good fit. Since our purpose is 
to get a good fit for all individuals, we only use the hierarchical model 
fitting to obtain a reasonable prior for each individual, and then fit each 
individual’s data, which is our focus, using a standard Bayesian nonlinear 
regression method. 

Bayesian Nonlinear Regression for Individual Subjects 
and a Two-Step Bayesian Approach 

The three-stage hierarchical Bayesian model introduced in the last 
subsection is only used to combine the informative prior at population 
level and the observed data from all individuals to estimate the prior 
distribution for each individual. Then, in the next step, our focus will be 
to fit the data from each individual using a standard Bayesian nonlinear 
regression method facilitated by the estimated individual priors from 
Step 1. This idea is similar to the empirical Bayesian method in which 
the observed data are also used to estimate the prior distributions. 

The standard Bayesian nonlinear regression model can be specified as 
follows. For subject i 

Model : y* = £(0.) + e u Mer 2 , 0i] ~ M(0, <7 2 I m ,), 

Prior-. <r~ 2 ~ Ga(a,b), 0* ~ N(0*,A*), 

where a, b> 0* and A? can be obtained from fitting the three-stage hier- 
archical Bayesian model introduced in Section 3.1. We fitted the data 
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from each individual independently using the MCMC method. Thus, 
our primary objective is to fit the data from all individuals well. In 
general, the two-step Bayesian approach can be summarized as follows. 

Step 1: Fit the three-stage hierarchical Bayesian model to esti- 
mate the prior distributions of unknown parameters for each in- 
dividual using the prior information at population level and the 
observed data from all individuals. 

Step 2: Fit a standard Bayesian nonlinear regression model to 
the data for each individual using the estimated prior distributions 
from Step 1. 

Choice of Priors 

In the proposed two-step Bayesian approach, we only need to specify 
the priors at population level at Step 1. The population estimates of 
unknown parameters are easy to obtain from the previous studies or lit- 
erature and usually the population estimates are accurate and reliable. 
The population prior information also helps to resolve the unidentifia- 
bility problem at individual levels. 

In principle, if we have reliable prior information for some of the 
parameters, strong prior (smaller variance) may be used for these pa- 
rameters. For other parameters such as 0, we may not have enough 
prior information or we may intend to use the available clinical data 
to determine since they are critical to quantify between-subject varia- 
tions in response; a non-informative (‘vague’) prior (a larger variance) 
may be given for these parameters. In particular, one usually chooses 
non-informative prior distributions for parameters of interest. 

Based on the discussion above, in our data analysis we chose the values 
of the hyper-parameters (population priors) in Step 1 as follows: 

a = 4.5, 6 = 9.0, ^ = 8.0 

T) = ( 0.0, 1.1, -1.0, 4.6, -2.3, 6.9, -11.6) r 

A = dia<7( 1000.0, 0.0025, 0.0025, 0.0025, 0.0025, 0.0025, 0.001) 

n = diag( 1.25, 2.5, 2.5, 2.0, 10.0, 2.0, 2.0) 

These values of the hyper-parameters were determined based on several 
studies published in [1,2,4,5,7,11]. In the second step, we also used a 
non-informative prior for 0, which is a primary interest for us. The 
sensitivity analysis for these priors will also carried out to investigate 
the robustness of our conclusions against the selections of these priors. 
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MCMC Implementation 

In the Bayesian approach, in addition to specifying the model for 
the observed data and prior distributions for the unknown parameters, 
inference concerning the unknown parameters is then based on their pos- 
terior distributions. In above Bayesian modeling approaches, evaluation 
of the required posterior distributions in a closed-form is prohibitive; 
however, it is relatively straightforward to derive either the full condi- 
tional distributions for some parameters or explicit expressions which are 
proportional to the full conditional distributions for other parameters. 

Bayesian techniques allow us to fit complex models using powerful 
Markov chain Monte Carlo (MCMC) methods. MCMC methods are an 
established suite of methodologies that enable samples to be drawn from 
the target density of interest under the Bayesian modeling framework. 
The methods work by defining a Markov chain whose stationary density 
is equal to the target density. The chain is then simulated for a time 
deemed adequate for convergence to have occurred, and the samples 
are drawn from the simulated chain. These samples are samples from 
the target density of interest. To implement an MCMC algorithm, here 
Gibbs sampling steps were used to update a _2 ,/u and £ _1 , while we 
updated 0*, i = 1, • • • ,n, using a Metropolis-Hastings (M-H) step with 
proposal density. 

When the convergence is reached, the resulting value is a draw from 
the target density. Thus, as the number of iteration increases, the chain 
approaches its equilibrium condition. After we collect the final MCMC 
samples, we are more interested in the posterior means and credibility 
intervals. For more detailed discussion of the MCMC scheme and the 
convergence of MCMC methods, consisting of a series of Gibbs sampling 
and M-H algorithms, please refer to literature [29-33] and more detailed 
descriptions of MCMC implementation were reported in [15]. 

Following an informal check of convergence based on graphical tech- 
niques according to the suggestion of Gelfand and Smith [34], for the 
three-stage hierarchical Bayesian modeling method (Step 1), after an 
initial number of 20,000 bum-in iterations, every 5th MCMC sample was 
retained from the next 80,000 samples. Thus, we obtained 16,000 sam- 
ples of targeted posterior distributions of the unknown parameters. How- 
ever, for the standard Bayesian nonlinear regression modeling method 
(Step 2), the MCMC was implemented with 300,000 iterations and the 
first 100,000 iterations were discarded. 40,000 samples of posterior dis- 
tributions of the unknown parameters were summarized by saving every 
5th MCMC sample from the next 200,000 samples. Note that a larger 
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number of MCMC iterations were chosen in Step 2 in order to accom- 
modate a small number of data for individual subjects. 

Sensitivity Analysis 

As we have known, a common concern with Bayesian methods is their 
dependence on various aspects of the modeling process. Possible sources 
of uncertainty include the prior distributions, the number of levels in 
the hierarchical model and initial values. The most basic tool for in- 
vestigating model uncertainty is the sensitivity analysis . That is, we 
simply make reasonable modifications to the assumptions in question, 
recompute the posterior quantities of interest, and see whether they have 
changed in a way that significantly affects the result interpretations or 
conclusions. If the results are robust against the suspected assumptions, 
we can report the results with confidence and our conclusions will be 
solid. However, if the results are sensitive to the assumptions, we wish 
to communicate the sensitivity results and interprets the results with 
caution [28]. 



APPLICATIONS TO AIDS CLINICAL TRIAL 
STUDY 

Clinical Trials and Data Description 

We illustrate the proposed methodologies using an AIDS clinical study. 
This study was a Phase I/II, randomized, open-label, 24-week compara- 
tive study of the pharmacokinetic, tolerability and antiretroviral effects 
of two regimens of indinavir (IDV), ritonavir (RTV), plus two nucleoside 
analogue reverse transcriptase inhibitors (NRTIs) with HIV- 1 -infected 
subjects failing Pi-containing antiretroviral therapies. The 44 subjects 
were randomly assigned to the two treatment arm A (IDV 800 mg ql2h 
+ RTV 200 mg ql2h) and Arm B (IDV 400 mg ql2h + RTV 400 mg 
ql2h). Out of the 44 subjects (36 men and 8 women with median age 
37 years old), 42 subjects are included in the analysis; for the remaining 
two subjects, one was excluded from the analysis since the PK parame- 
ters can not be calculated and the other was excluded since PhenoSense 
HIV could not be completed on this subject due to an atypical genetic 
sequence that causes the viral genome to be cut by an enzyme used in 
the assay. 

Plasma HIV-1 RNA (viral load) measurements were taken at days 0, 7, 
14, 28, 56, 84, 112, 140 and 168 of follow-up. The nucleic acid sequence- 
based amplification assay (NASBA) was used to measure plasma HIV-1 
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RNA, with a lower limit of quantification, 50 copies ImL. The HIV-1 
RNA measures below this limit are not considered reliable, therefore we 
simply imputed such values as 25 copies ImL. 

The data for pharmacokinetic parameters (Cuh, Cmax > Cmin and 
AUC), phenotype marker (baseline and failure IC^s) and adherence 
were used in our modeling. Adherence data were determined from the 
percent-adherence of subjects by drug according to pill-count. 

Modeling Results 

Figure 1 shows the individual fitted curve with observed viral load 
data in logio scale, as well as the estimated drug efficacy ( 7 (()), observed 
ICso(t) and adherence for the six representative subjects. It can be seen 
that the models seem to provide a good fit to the observed data. It is 
worth noting by comparing the plots of fitted curves and estimated drug 
efficacy that, in general, if 7 (t) falls below the threshold e c , viral load 
rebounds, and in contrast, if y(t) is above e c , the corresponding viral 
load does not rebound, which is consistent with our theoretical analysis 
of the dynamic models [ 10 ]. 

As stated above, the hierarchical model fitting may produce good 
estimates of population parameters, but it may not fit all individual data 
well, in particular, when the between-subject variation is large. This is 
because the objective of fitting the hierarchical model is to minimize 
overall risk of all subjects. Thus, some subjects whose data are away 
from the population mean may not get a good fit. Since our purpose is 
to get a good fit for all individuals, we only use the hierarchical model 
fitting to obtain a reasonable prior for each individual, and then fit 
each individual’s data, which is our focus, using a standard Bayesian 
nonlinear regression method. As a result, we compare the performance 
of model fitting between the two step Bayesian approaches. It shows that 
the accumulated sum of squared residuals for 42 subjects are 99.087 and 
33.693 for Step 1 and Step 2, respectively. Fig. 2 compares the individual 
sum of squared residuals (SSR) at Step 1 with those at Step 2 for 42 
subjects. It can be seen that model fitting at Step 2 outperforms that 
at Step 1 based on the SSR. 

Sensitivity Analysis 

To examine the dependence of dynamic parameter estimates on the 
prior distributions and initial values, we carried out the sensitivity anal- 
ysis. We implemented the MCMC sampling scheme using FORTRAN 
code and monitor several independent MCMC runs, starting from dif- 
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SSR al Step t 

Fig. 2. A comparison of individual sum of squared residuals (SSR) between Step 
1 and Step 2 (o: Arm A; x: Arm B), 



ferent priors and/or different initial values. The results were similar and 
stable (data not shown). 

In fact, compared with the mean vector t] = (0.0, 1.1, —1.0, 4.6, —2.3, 
6.9, — 11.6) t of prior distributions used in Section 3.3, we chose two 
alternative mean vectors ij — (0.7, 1.8, -0.35, 5.3, —1.6, 7.6, — 10.9) T 
(higher level) and r] = (-0.7, 0.4, -1.7, 3.9, -3.0, 6.2, — 12.3) r (lower 
level) of prior distributions, respectively, with two sets of different initial 
values in sensitivity analysis. 

We summarize the sensitivity analysis results as follows: (i) the esti- 
mated dynamic parameters were more sensitive to the priors than to the 
initials, but both results are reasonable and robust (data not shown), (ii) 
when different priors and/or different initials were used, the results fol- 
low the same patterns as those presented in Section 4.2. The conclusions 
of our analysis remain unchanged. 



DISCUSSIONS 

This paper investigated a two-step Bayesian approach for estimating 
dynamic parameters in HIV dynamic models during a long-term treat- 
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ment period. We have developed a modified HIV dynamic model with 
time-varying antiviral drug efficacy, which is a system of nonlinear dif- 
ferential equations. We established relationships of virological response 
(viral load trajectory) with drug exposure (PK and adherence) and drug 
sensitivity (/C 50 ) via a viral dynamic model. The model fitting to the 
observed data for individual subjects was very well. 

Bayesian approach offers several advantages over other methods (such 
as nonlinear least squares method) for estimating parameters in HIV 
dynamic models. First, a Bayesian approach is a powerful way for in- 
corporating prior information from the existing viral dynamic studies. 
Not only the point estimates of viral dynamic parameters from previous 
studies can be incorporated, but also the uncertainty of these estimates 
can be incorporated in the analysis. This is very important when the 
current clinical data are not enough to identify all viral dynamic pa- 
rameters. Thus, a Bayesian method is a powerful and efficient tool to 
process all the information available. Secondly, a Bayesian approach 
provides full posterior distributions of parameters, not just point esti- 
mates, which can give a complete inference for the parameters of interest. 
Thirdly, the Gibbs sampler or related refinements of MCMC methods 
under the Bayesian framework provide flexible and powerful tools to 
estimate parameters in complex models. 

In summary, the mechanism-based dynamic model is powerful and 
efficient to establish a relationship of antiviral response with drug ex- 
posure and drug sensitivity, although some biological assumptions have 
to be made. The dynamic parameters for individual subjects can be 
identified by borrowing information from prior population estimates us- 
ing the newly developed Bayesian approach. The established models 
can be used to simulate antiviral responses of new antiviral agents and 
new treatment strategies. A full-blown AIDS clinical trial can be simu- 
lated to assist in designing a complicated trial and in studying biological 
hypothesis using the developed models and methods. 
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INTRODUCTION 

Many commonly used anticancer agents cause serious toxicity. Since 
cytotoxic agents target rapidly dividing cells, normal healthy cells that 
turn over rapidly are often hit by these agents, along with cancer cells 
that are the intended target. Of primary clinical importance are the 
neutrophil cells in the blood. These cells play an important role in the 
immune system, and abnormally low levels put the patient at increased 
risk of serious, perhaps fatal, infection. Therefore, dosing of anticancer 
agents is limited by the effect of the drugs on the immune system. Medi- 
cal oncologists have to strike the right balance between providing benefit 
and avoiding life-threatening side effects. 

Over the past 15 years, the notion of delivering very high doses of 
chemotherapy to treat some forms of cancer has gained a foothold among 
medical oncologists. With this form of therapy, the patient receives doses 
of toxic anticancer agents that are high enough to virtually wipe out the 
immune system of all white blood cells. Concomitant treatments, espe- 
cially infusion of fresh blood cells, help jumpstart the patient’s marrow 
to produce new blood cells. By transplanting either the patient’s own 
blood cells or blood cells from a matched donor, the hope is to make 
severe hematologic toxicity a reversible side effect. If all goes well, the 
ultra-high doses of chemotherapy also kill all cancer cells that may be 
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circulating throughout the body, leaving the patient thoroughly cancer 
free. Of course, too high a dose will kill more than just blood cells and 
may kill the patient. Therefore, the treating physician would like to find 
the dose that is high enough to benefit the patient but not so high that 
it causes the patient to die. 

At the University of Texas M. D. Anderson Cancer Center, there is 
a group of medical oncologists who are treating leukemia patients with 
high-dose chemotherapy, followed by bone marrow transplantation. A 
commonly used regimen to ablate the marrow is the combination of two 
drugs: cyclophosphamide and busulfan [1,2]. As with all drugs, however, 
inter-individual variation contributes to different levels of exposure to 
the same dose of the same drug. Such heterogeneity can cause severe 
toxicity for one patient, whereas the concentrations in another patient 
may be below therapeutic levels. 

One of us (B.S.A.) has considered individualizing the dose of busulfan 
delivered intravenously, targeting systemic exposure within an appropri- 
ate range. The measure of systemic exposure we consider to be relevant 
to this study is the area-under-the-concentration-time curve (AUC) of 
intravenous busulfan [3]. The idea of this study is to deliver to the pa- 
tient a low, test dose of i.v. busulfan, measure the pharmacokinetics 
(PK), and then use the patient-specific data, along with everything else 
we know, to arrive at the full-dose of busulfan that is most likely to 
result in the patient’s AUC falling in the target range. Dose individual- 
ization via feedback mechanisms has a long history in medicine. Jelliffe 
and colleagues describe many of the issues involved in using population- 
based models for dose individualization [4-7]. D'Argenio and Rodman 
described a population-based method for individualizing the dose of the 
anticancer drug Teniposide [8], Sandstrom et al. [9] discuss individual- 
izing the dose of oral busulfan. Examples outside of medical oncology 
exist, as well [10,11]. 

In this paper, we describe a fully-Bayesian approach to finding the 
optimal individualized dose in this transplant setting. The questions we 
address concern determining the optimal dose by minimizing a loss func- 
tion (or maximizing a utility function) and combining information across 
related but different studies, allowing for inter-patient and inter-study 
heterogeneity with a flexible nonparametric prior model. Wakefield [12] 
discusses using Bayesian decision theory to determine the optimal dose 
for a patient to achieve some target concentration. 

We choose to work within a Bayesian framework for a number of rea- 
sons. Bayesian models allow one to incorporate information from previ- 
ous studies, expert opinion, and the literature, all within a probability 
model, namely, the prior distribution. Additionally, Bayesian model- 
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ing is extremely well suited for prediction and estimation with honest 
ascertainment of uncertainty. 

In our example, we use data from studies with fixed high doses of the 
drugs to learn about between-patient heterogeneity in pharmacokinetics. 
We then combine these data with a second study that records concentra- 
tions at a low dose and a fixed high dose. The second study allows us to 
learn about dose dependence, in addition to providing more information 
about high-dose pharmacokinetics in the population. Finally, in a third 
step we observe low dose data for a new patient. Putting the pieces 
together within a coherent probability model, we calculate the optimal 
therapeutic high dose for that patient. 



MODEL 

We build our probability model around a hierarchical or population 
PK model [13]. Specifically, let be the observation (i.e., logarithm 
of plasma concentration) for patient i (i = 1, . . . , n s ) at time where 
s indexes the study (s = 1, 2, . . . , S) and j indexes the different measure- 
ment times (j = 1,2, ... ,n 3 i). In our application, s = 1,2 and 3 refer 
to the high-dose study, the second study with low and high doses, and 
the new patient in the third study, respectively. The probability model 
is given by the following set of equations. 



Observation 


p(y s ,i,j\0i) , s= l,..., 


j — 1 , 5 7l{ 


Subject-specific parameters 


P(0i|0), i= 1 




Population 


p{<t>) 


(1) 



At the observation level, we assume that a compartment model charac- 
terizes the drug concentrations over time for each patient. The residual 
between the observed and the expected concentration is lognormal (i.e., 
normal residual for the logarithm of the concentration or within-patient 
variation). This is a common assumption, and it seems to fit our data 
well. 

Many researchers assume that a normal or lognormal distribution 
characterizes the variation between patients in the compartmental PK 
parameters. This assumption may not always hold, however, and there 
may even be severe departures. For example, there is known genetic vari- 
ability in many key enzymes involved in drug metabolism, contributing 
to observed multi-modal distributions of parameters in a population [14, 
15]. There may also be unmeasured covariates that contribute to fatter 
tails in the empirical distribution than can be captured adequately by a 
normal or lognormal distribution. 
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Inference about mean levels is often reasonably robust with respect to 
distributional assumptions about the population distribution. For the 
deshed dose optimization, however, appropriate description of predictive 
uncertainties is critical, as the decision is driven by the probability of 
extreme values that can lead to adverse events, including death. In the 
context of the deshed optimal design, it is thus important to avoid overly 
restrictive parametric assumptions. 

We, therefore, choose to use a Bayesian nonparametric model for the 
prior distribution of subject-specific parameters in the population. We 
have found that these prior models are extremely flexible and allow one 
to capture most any form of probability distribution. In particular, we 
use a model that is a Dirichlet process mixture of normals [16-19]. 

In the next section, we discuss the statistical model. We then go on to 
describe the calculations for determining the optimal dose. We include 
an example with simulated data. 



STATISTICAL MODEL 

Assume we have data from two earlier studies in which patients re- 
ceived i.v. busulfan in two different schedules as transplant therapy. 
One phase II study included four once-a-day 3-hour infusions of busul- 
fan following fludarabine. The other phase II study treated the patients 
over 4 days with a 2-hour infusion of busulfan every 6 hours, followed by 
cyclophosphamide. These data allow us to learn something about the 
population pharmacokinetics of busulfan in these two clinical settings. 

We currently are treating patients with a low dose of busulfan (12 
mg over 30 minutes) prior to the high dose as part of one of the two 
schedules discussed in the previous paragraph. From these data, we will 
get information on the pharmacokinetics of i.v. busulfan at a low dose, 
along with pharmacokinetics at the higher dose, for each participant 
from this second study. The within-patient data will allow inference 
on the linearity of i.v. busulfan’s pharmacokinetics over a very wide 
range of doses. The paired low-dose and high-dose PK data will also 
provide information on how well we might be able to predict the patient’s 
pharmacokinetics with high-dose busulfan based on estimates using only 
the low-dose intravenous infusion of the drug. 

At the high doses, linear compartmental models appear to characterize 
the pharmacokinetics of i.v. busulfan fairly well [2, 20]. For illustration, 
our mathematical model at the observation (i.e., concentration) level 
assumes that the logarithm of the busulfan concentrations is normally 
distributed with mean given by the logarithm of a one-compartment 
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model and common residual variance over time and patients. The one- 
compartment model is parameterized in terms of the logarithm of the 
eli mi nation rate constant and the logarithm of the volume of distribu- 
tion. 



DIRICHLET PROCESS 

One could model the distribution of the subjects’ two PK parameters 
with a bivariate lognormal distribution. Selecting the desired optimal 
dose, however, can depend critically on outliers, skewness, and, more 
generally, departure from restrictive parametric forms. Therefore, we use 
a flexible Dirichlet process mixture of normals prior for the population 
model [16,17,21]. We have found a mixture prior to provide a very 
flexible way to carry out inference in the presence of uncertainty about 
the specific nature of the prior distribution. A Dirichlet process (DP) is 
a distribution on the space of distributions, in fact, allowing for inference 
into the nature of the population distribution. This formulation allows 
for skewness, multimodality, and many shapes beyond the standard bell- 
shaped curve. A particular need might arise when one wants to allow for 
heterogeneity within the sampled population because of genetic diversity 
or unknown covariates. 

One difficulty with the DP is that it generates almost surely discrete 
distributions. A commonly used fix is to form a convolution with an 
additional continuous kernel, such as the normal density. The resulting 
model takes the form of a DP mixture of normal distributions, with the 
DP providing the mixing measure on the normal location parameters 
with respect to which the mixture is defined. From a data analytic 
perspective, the model is best understood as a model-based version of 
traditional kernel density estimation illustrated in Fig. 1. 

It turns out that one can characterize almost any density with a mix- 
ture of simpler distributions. Rather than choose the number of con- 
stituent distributions in the mixture ahead of time, we allow for a ran- 
dom number of locations through the use of a Dirichlet process mixture. 
See, for example, Ghosal et al. [22] for a formal statement in the context 
of the DP mixture models. 

Technically, we write the model in terms of the DP’s two parameters: 
the base measure (Go) and the so-called total mass parameter (M). If 
$i is the vector of subject-specific parameters and is the mean of the 
distribution of this vector, we assume that 



0i ~ N(m,v) 
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P(X) 




X 



Fig. 1 . Motivation for normal mixture model through kernel smoothing. Normal 
densities are centered at discrete random locations according to some unknown dis- 
tribution. Integration over the range of random variable X yields the desired mixing 
of normals. 



fii ~ G 

G ~ DP (G 0 , M ) , with Gq — N (6, B) (2) 

In this model, G denotes a distribution from a Dirichlet process. The 
base measure (Go) is multivariate normal centered at the vector b with 
variance-covariance matrix B. A key property of the DP is the discrete 
nature of the DP random measure G; i.e., we can write G as 

OO 

G = J> fc aoi h ) (3) 

h= 1 

where 6 ( x ) indicates a point mass at location x. We can thus write Eq. 
(2) alternatively as a mixture of Normals 

OO 

p (0*i0) = WkN ^ 

h= 1 

showing that the result of the convolution is a weighted sum or normal 
densities. 




245 



Therefore, the second level of the hierarchy in Eq. (1) becomes a mixture 
of MV Normals 



00 

p(m = 5> fc woifc,v) (4) 

h=l 

The full hierarchical model for one of the studies is 



log {yij) = log 
ey ~ N , 

Oi-NfaS), 1 
Mi ~ G } 

G~ DP {Go, M) 



[/ ($i> ^ij)] + £ ij 

t ~ Ga ( a T ,0 T ) 

oo 

Oi ~ ^ 2 w h N Oifc, 5 ) 

/i=i 

00 

i.e., G = Y^ w h s (/**) 

A=1 



( 5 ) 



DEPENDENT DIRICHLET PROCESS 

In Eqs. (2) and (4), we defined a mixture model population distri- 
bution for a single study. We need to impose some structure into the 
mixture prior distribution to allow us to borrow strength across the re- 
lated (but different) PK studies yet still allow for heterogeneities across 
the different studies. Wakefield and Rahman [23] describe how one can 
combine PK studies when the population distributions have a simple 
parametric form, such as a normal or a Student t distribution. Tom- 
linson and Escobar [24] discuss one approach for combining separate 
mixture models. Kottas and Gelfand [25] discuss stochastically ordered 
nonparametric functions. Lopes, Mller, and Rosner [26] combine related 
study data in a meta-analysis based on finite mixture measures. 

We use a dependent Dirichlet process model [27,28,29] to provide the 
necessary structure. The particular model assumes a prior population 
distribution as in Eq. (5) for each individual study and relates the 
random locations in the mixture (Eq. (4)) by means of structure 
rem in iscent of an analysis of variance (ANOVA). In the ANOVA de- 
composition of means, one relates the location across levels of factors 
in terms of additive fixed effects. For example, if one has three studies 
and one wants to allow in the analysis for the average to change sys- 
tematically across the studies, then one might model the average as an 
overall mean plus study-specific deviations from that overall average. 
The overall mean and study- specific deviations from that overall mean 
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make up four parameters. With three studies, however, one can only 
estimate three parameters. Therefore, one constrains the four parame- 
ters in some way to leave only three free parameters. Let E [Yi] denote 
the mean for study i in an ANOVA model. A common parameterization 
is E [Yi\ = ao + a,, i = 1 , 2 , 3 , where is the mean of the values in 
the ith study, ao is the overall mean, and a t is the deviation of the ith 
study from the overall average. A common identifiability constraint sets 
cti = 0 , effectively making ao = E [Y*] . 

In our prior probability model, we use the ANOVA structure to induce 
dependence of the terms in the mixture in Eq. ( 4 ) across the studies. 
Each of the random locations of the mixture is characterized as a baseline 
location plus a factor-level- specific deviation from the baseline location 
for each study’s set of parameters. In our example, we want to use a 
mixture model within each of the three studies. Let G be the mixing 

OO 

distribution of discrete locations (i.e., G — (/m))- We add a 

h= 1 

study index and write n a h for study s, point mass h. For each study or 
factor level (that is, study 1 , 2 , or 3 in our case), we can have separate 
mixing distributions (Gi, G 2 and G3) as in 

Study 1 : G\ = (/xn) + (^12) + U36 (M13) + • • • 

Study 2 : G% = uj\6 (//21 ) + ^2^ (^22) + UI3S (H23) + • • • 

Study 3 : G3 = wi <5 (/i 31 ) + U2S (fj, 32 ) + U36 (/* 33 ) + . . . 

The random locations are functions of the study through the ANOVA- 
like decomposition of the locations: 

G s (<p) = ^ WfiS (AWi) - with Ms/i = Mh + ois,h t a = 1 , 2 , 3 . 

Here, the a s< h are similar to offsets in an analysis of variance decomposi- 
tion of locations. As in the ANOVA model, we impose an identifiability 
constraint a\j t = 0 across all h. The random weights (i.e., U\, W2, W3, 

. . . ) are the same across the levels of the factor, however. Only the 
random locations change by factor level; these changes are themselves 
random. Now, the full hierarchical model across the studies is the fol- 
lowing. 



log (ysij) = 
Eij ~ N (0,T -1 ) , 

dsi ~ N {flgii S) , 1 
/At ~ G J 

G 



log 



[/ (@sii tsij)} T &ij 
t ~ Ga(a T ,/ 3 T ) 

OO 

^ai ~ ^ ^ WfrN (fA s h> <S) 
h=l 

DP (Gq, M) 



r^j 



( 6 ) 
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One can make clear the mixture of normals by writing 

0 a i ~ J N (usi, S ) dG s i {usi)) s — 1 ) 2, 3 

with each G a i (•) a Dirichlet process. 

We carry out posterior inference using Markov chain Monte Carlo. 
Since the base measure is normal, it turns out to be relatively easy to gen- 
erate random samples from the correct posterior distribution. MacEach- 
ern and Muller [30] consider posterior simulation for a general Dirichlet 
mixture model, while de Iorio et al. [28] and Muller et al. [29] discuss 
the extension to the Dependent Dirichlet Process model. 



BAYESIAN OPTIMAL DESIGN 

We wish to find the dose that has the best chance of producing sys- 
temic exposure in the target range for the patient, taking account of 
the uncertainty associated with the prediction. Bayesian decision the- 
ory provides a structure within which one can determine the optimal 
action out of a set of possible actions, accounting for uncertainties in 
outcomes and incorporating many different aspects of losses and gains 
or utilities [31, 32]. One integrates the loss function with respect to the 
predictive distribution for each possible decision. When one has carried 
out these calculations, one can then pick the decision associated with 
the maximum utility (or minimum loss). 

Since the loss is a function of the AUC, we can think of the loss as 
a function of the dose, the achieved concentrations, and the patient’s 
PK parameters. All three of these factors determine the concentrations 
we would expect to observe for each patient. Letting u () be the utility 
function, y the observations (i.e., log concentrations), d the dose, and 
$i the patient’s PK parameters, we can write the negative of the loss 
function as the utility we wish to maximize (i.e., minimize the loss) 

u(y,d,0) = L(AUC), AUC = f(d,V,k ) 



The goal is to pick the dose d* that maximizes the expected utility (or 
minimizes the risk or expected loss) 



d* — arg max E [u (y, d, 0)] = J J u(y,d,d)p d (y,6) dy d6 
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SIMULATED EXAMPLE 



We demonstrate the application of the method with simulated data 
since the busulfan studies are ongoing. In the simulations, we assume 
that we have 3 studies. For the first study we assume that 30 patients 
each received a 60-minute infusion of 1.8 g of a hypothetical drug. In 
the second study 30 patients received both a lower dose of 360 mg (one- 
fifth of the high dose) as a test dose and the 1.8 g high dose of the 
hypothetical drug. This second study provides information about the 
relationship between the low-dose PK and the high-dose PK for each 
patient, as well as information about the population distribution of the 
PK parameters. In both cases the infusion duration was 60 minutes. 
Finally, we simulated a patient in the third study who received only the 
360 mg test dose. In all cases the sampling times were 30, 60, 90, 120, 
and 180 minutes from the start of the infusion. We use the estimated 
pharmacokinetic parameters for this patient from the test dose, along 
with the historical information, to determine the dose that is optimal in 
the sense of minimizing the expected loss based on the projected AUC. 

Each patient’s simulated concentrations came from a one-compartment 
model with noise added on the logarithmic scale. First, a pair of PK 
parameters ( Vd — volume of distribution and k e = elimination rate) was 
generated from a bivariate lognormal distribution for each patient. In 
each simulated study, half of the PK parameters came from one of two 
bivariate lognormal distributions. Table 1 shows the distributions we 
used to generate each simulated patient’s PK parameters for subsequent 
generation of observed concentrations. 

The posterior distributions of the logarithms of the PK parameters 
from each of the two simulated phase II studies are shown in Fig. 2 
for illustration. The concentrations generated for the hypothetical first 
patient in study 3 (treated with the low dose) is shown in Fig. 3, along 
with the posterior distribution for this patient’s PK parameters. This 
posterior distribution borrows strength from the two earlier studies by 
means of the DDP ANOVA probability model. 

In our example, the loss function penalizes us if the AUC falls outside 
of the target range. Furthermore, we consider the consequences of too 
high an AUC to be worse than if one’s systemic exposure is below the 
target, leading to an asymmetric loss function (see Fig. 4). 



L ( auc ) = / 



L (auc, AUC a) 
0 

L + (auc, AUC U {) 



if auc < AU Cll 
if AU Cn < auc < AU C u i 
if auc > AU C u i 



( 7 ) 



With this loss function, there is zero loss whenever the achieved AUC 
lies between a lower limit (AUCu) and an upper limit (AUC u i). If the 
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Table 1. Distributions of pharmacokinetic parameters used in the simulations by 
study. 



Study 1°: 



( log (Vj) 
V log (fee) 




Study 2 b : 



( log (Vd) 

\ log (fee) 




Study 3 e : 



( log (Vd) 
V log (fc«) 
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Fig. 2. Posterior distributions for PK parameters for the two earlier studies. 



AUC is outside of this range, the loss increases linearly as the achieved 
AUC deviates from these two limits. The loss increases more quickly for 
AUCs that are too high than for AUCs too low, reflecting the greater 
risk of immediate death. 

Integrating the loss function (see Fig. 4 and Eq. (7)) with respect to 
the predictive distribution for this patient’s AUC at various high doses 
leads to the expected loss associated with each dose. That is, for each 
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Fig. 3. Simulated low-dose concentrations for a new patient in the third study and 
the resulting posterior distribution for the patient’s PK parameters. 



dose d, we compute 



E [ u (; y , d, 0)] = J L [ AUC (9, d)}p(9\D) dO 

now explicitly writing in the dose d to show how the calculation is for a 
fixed dose. Figure 3 shows the resulting curve of expected losses by dose 
(grams). From the curve, we pick the optimal dose for the patient to be 
around 1.9 g. 
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AUC 



Fig. 4. Asymmetric toss function for targeting AUC in the busulfan example. 

SUMMARY 

We described the use of Bayesian decision theory in the design of a 
dose-individualization study. The particular application was treatment 
of leukemia with high-dose chemotherapy and stem-qell transplantation. 
Data were assumed already available from earlier phase II studies in 
which patients received a fixed high dose of chemotherapy. In one of the 
earlier studies, patients received a low, test dose of the drug, in addition 
to the same high-dose chemotherapy as in the other phase II study. 
We wanted to combine the historical information with inference about a 
patient’s own pharmacokinetics to help make more precise predictions of 
the patient’s AUC at various high doses, compared with using only the 
patient’s low-dose PK information. With the use of a simple, asymmetric 
loss function, we computed the expected loss for each possible dose and 
determined the dose that mini mi zed the loss. 
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INTRODUCTION 

Cancer is the second leading cause of death in the United States, and 
it accounts for approximately ten cents of every U.S. dollar spent on 
health care [1]. In 2003, over one million new cases are expected, not 
including carcinoma in situ (except urinary bladder) and basal and squa- 
mous cell skin cancers [1]. The apparent increased growth rate of can- 
cerous cells is attributed to a subtle imbalance in the rates of apoptosis 
(programmed cell death) and cell proliferation. This imbalance toward 
proliferation leads to increases in tumor size and eventually metasta- 
sis. Death of the host organism is often due directly or indirectly ( e.g . 
through complications) to tumor tissue interfering with healthy tissue 
function, potentially in an area inaccessible to anticancer agents. The 
observed primary tumor volume, in general, increases until the host is 
killed. From an engineering perspective, the dynamic profile of tumor 
size versus time has a shape similar to the temperature versus time pro- 
file of an unstable chemical reactor. Here the temperature of the reactor 
contents increase (due to an exothermic reaction) until the reactor rup- 
tures due to the extreme temperatures - the engineering equivalent of 
host death. 
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To maintain the viability of the reactor, control systems are incor- 
porated that alter the conditions of the reactor to maintain stability 
and simultaneously allow the reactor to generate the desired product 
concentrations. Likewise, one can consider the use of control theory to 
aid in cancer treatment design. Chemotherapy, the use of systemically- 
delivered chemical compounds to reduce tumor burden, is employed 
alone, or in combination with other treatment modalities (e.g. radi- 
ation, surgery, etc.) to treat malignant tumors [2]. The traditional 
method of schedule development is to employ pre-clinical animal studies 
to identify a candidate starting dose and candidate toxicities that may 
occur in Phase I trials. Phase I trials then establish the maximum tol- 
erated dose of the chemotherapeutic compound, the route by which the 
compound is to be administered, the schedule on which the compound 
should be delivered, and the treatment-related toxicities resulting from 
the established dose/route/schedule. Phase II studies are used to estab- 
lish efficacy of the compound, and assuming the compound is effective 
in treating a particular cancer, Phase III studies compare the new com- 
pound to the existing “gold standard” for treating the cancer of interest. 
The final schedule/route/dose is then employed in treatment, with stan- 
dard modifications employed to alter treatment for patients who develop 
unacceptable toxicity, or possibly who do not respond. 

A potentially more efficient method for dosage adjustment would be 
to have a “real-time” chemotherapy treatment algorithm recommend 
dosage alterations at routine intervals during treatment. In this case, 
the first cycle of therapy may be used to establish patient-specific char- 
acteristics, such as pharmacokinetics and toxicity. After the second cy- 
cle, response would be evaluated. Given this information, models of 
tumor growth, drug kinetics, and drug effect could be developed and 
employed in a model-based dose adjustment algorithm. This would al- 
low the alteration of dosage (and potentially schedule) after each cycle of 
therapy in response to information available from periodic clinical visits 
during treatment. Model-based control theory can provide a rigorous 
framework within which to develop a treatment scheduling algorithm 
for chemotherapy-treated cancer patients. 

Moldeling for Cancer Treatment 

Controllers for processes are constructed using either implicit or ex- 
plicit mathematical models [31. The present approach uses pharmacoki- 
netic (PK)/pharmacodynamic (PD) guided treatment. Using models of 
the pharmacokinetics and pharmacodynamic effect of the drug on the 
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Fig. 1. Schematic diagram of model-based anticancer treatment design. Pharma- 
cokinetic/pharmacodynamic patient models are explicitly used in the control algo- 
rithm to guide drug dose and delivery schedule. Measurements of pharmacokinetics 
and PD effect, including toxicity, are used to modify treatment as necessary. 



tumor, one can develop an automated treatment algprithm as shown in 
Fig. 1. 

Using the model of the patient, the control algorithm predicts the 
effect of a given dose or series of doses on the patient and tumor. By 
specifying a desired tumor trajectory, mathematical optimization can 
be used to develop a dose and schedule that drives the tumor along the 
desired trajectory subject to delivery constraints imposed by toxicity. 

Two modeling issues are significant in the context of model-based 
dosage design. Model quality is a limiting factor of achievable perfor- 
mance for model-based control algorithms [4]. This implies that more 
complicated mathematical models, such as nonlinear or highly param- 
eterized models, are required to employ higher-performing dosing algo- 
rithms. However, the ability to optimize patient treatment successfully 
is dependent on the structure of the model; a highly nonlinear model 
may result in an optimization problem with multiple local solutions and 
therefore no guarantee of reaching the treatment schedule that optimizes 
patient well-being and efficacy. 

Hence, the focus of the modeling component of this work is on the con- 
cept of control-relevant modeling [5,6]. A control-relevant model is one 
that is able to capture the time response of drug kinetics or pharmacody- 
namics while using a minimum number of parameters and a model struc- 
ture amenable to use in optimization. While physiologically-structured 
models are intuitively satisfying, models of this form often contain a 
large number of parameters that need to be identified from data. By 
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Fig. 2. Chemical structure of 9NC in active lactone and inactive hydroxy-acid 
forms. The two forms are in equilibrium after systemic delivery. 



comparison to classical engineering systems (that may be sampled at 
a rate of once per minute or more), the amount of patient data avail- 
able for parameter identification is markedly reduced. As a result, the 
present study relies upon an empirical model structure to relate drug 
dose to the PK and PD response profiles. 

Case Study 

In evaluating the above premise that control theory can be employed 
in the development of chemotherapy schedules, the following case study 
is employed. HT29 human colon carcinoma xenografts are implanted 
into severe combined immune deficient (SCID) mice. The control group 
is handled, but untreated, and the treated group receives 9-nitrocampto- 
thecin (9NC). This is a member of the class of topoisomerase I inhibitors, 
and co-exists in lactone and hydroxy- acid forms in the plasma (Fig. 2). 

Here the lactone form has antitumor activity, while the hydroxy- 
acid form is inactive. In addition to the parent compound, its pri- 
mary metabolite (9-aminocamptothecin, or 9AC), in which the 9NC 
nitro group (NO 2 ) is reduced to an amino group (NH 2 ), is also active 
in the lactone, but not hydroxy-acid, form. At present, 9NC is in Phase 
III trials for pancreatic and colorectal cancer [7], The remaining sec- 
tions address the modeling and dosage design for 9NC preclinical trials 
in SCID mice. 
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METHODS 
Experimental Studies 

Reagents 

All chemicals for HPLC analysis were HPLC grade and purchased 
from Fisher Scientific (Pittsburgh, PA). N,N-Dimethylacetamide (DMA) 
was purchased from Aldrich (Milwaukee, WI); polyethylene glycol 400, 
liquid (PEG 400) was purchased from Baker (Phillipsburg, NJ). Camp- 
tothecin, N,N-dimethylformamide and carbonyl iron (pentacarbonyl iron) 
were purchased from Sigma- Aldrich, (St. Louis, MO). 9NC and 9AC 
were provided by SuperGen (Dublin, CA). 5-Fluorouracil was purchased 
as the clinical formulation, Adrucil (Pharmacia & Upjohn, Kalamazoo, 
MI) 

Drug Formulations 

9NC was prepared at concentrations of 0.1,0.067 or 0.044 mg/ml in 
vehicle (2% DMA in 0.1 M phosphoric acid:PEG 400(49:51 v/v)). 9NC 
doses and vehicle were administered p.o. at a volume of 0.01 ml/g body 
weight using a 20-gauge oral gavage needle and 1 ml syringe. For PK 
studies, the 9NC dose was 0.67 mg/kg/day while doses of 1 mg/kg, 
0.67 mg/kg and 0.44 mg/kg were administered in efficacy studies. The 
positive control for the efficacy study was 5-Fluorouracil, diluted from 
the clinical formulation with saline such that the dose administered was 
20 mg/kg IP (0.01 ml/g body weight). 

Mice 

Female C.B.-17 SCID mice (4-6 weeks of age, specific-pathogen-free) 
were obtained from the National Cancer Institute (NCI) Animal Pro- 
duction Program (Frederick, MD) and were allowed to acclimate to the 
University of Pittsburgh Central Animal Facility for 1 week prior to ini- 
tiation of study. Mice were housed in autoclaved microisolator caging 
and were given Prolab ISOPRO RMH 3000 Irradiated Lab Diet (PMI 
Nutrition International, Brentwood, MO) and autoclaved water ad li- 
bitum. Animal rooms were maintained on a 12-hour light/dark cycle 
with at least 12 air changes/hour, and temperature was maintained at 
72 ± 2 °F. All animals were handled in accordance with the Guide to 
the Care and Use of Laboratory Animals [8] and on a protocol approved 
by the Institutional Animal Care and Use Committee of the University 
of Pittsburgh. Tri-monthly analysis of sentinel mice housed in 1/5 dirty 
bedding confirmed that the study mice remained MAP (murine antibody 
profile) test-negative throughout the study. 
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Tumor Line, Implantation, Measurements, and Calculations 

HT29 human colon xenografts were obtained from the NCI Tumor 
Repository (Frederick, MD) and were MAP test-negative. HT29 tu- 
mors were passaged in C.B.-17 SCID female mice as « 25 mg frag- 
ments implanted subcutaneously on the right flank by aseptic techniques. 
Fragments of tumor(~ 25 mg) harvested from passage mice were sub- 
sequently implanted subcutaneously into the study mice. Mice were 
observed twice daily. Tumors were measured using a digital caliper, and 
both body weights and tumor volumes were recorded twice weekly. Tu- 
mor volumes, V, were calculated from the formula: V = where l is 
the longest tumor diameter, and w is the shortest diameter perpendicu- 
lar to the direction of l . 

Efficacy Studies 

Mice bearing HT29 tumors were stratified to treatment groups of 10 
mice each on day 19 post implantation (three days prior to treatment) 
such that mean and median body weight and tumor volumes for the 
groups of mice were not statistically different. Tumor volumes on day 
19 were between 26 and 71 mm 3 . Treatment groups were as follows: con- 
trol, vehicle-treated control, positive control (5-fluorouracil, 20 mg/kg), 
9NC 1.0 mg/kg/day, 9NC 0.67 mg/kg/day and 9NC 0.44 mg/kg/day. 
Treatment began on day 21, and this was defined as study day zero. 
Mice on the efficacy study received daily doses p.o. for 5 days/week 
for two weeks followed by a two week period of no treatment before 
the treatment regime was repeated. The positive control group received 
doses i.p. on the same schedule. At the completion of the efficacy study, 
mice were euthanized, and complete necropsies were performed. Only 
data from the control groups and 9NC, 0.67 mg/kg, group were used for 
this PK/PD method development. 

Pharmacokinetic Studies 

Pharmacokinetic studies were performed in non- tumor-bearing (NTB) 
female C.B-17 SCID mice and in female C.B-17 SCID mice bearing HT29 
tumors at 27 days post tumor implantation. Mice were stratified into 
groups of three such that the mean and median body weight and tumor 
volumes were similar across groups. Animals were fasted overnight prior 
to dosing, and 0.67 mg/kg 9NC was administered to the mice as a sin- 
gle oral dose based on fasted body weight. After 9NC administration, 
groups of three mice were euthanized by CO 2 inhalation, and blood was 
collected by cardiac puncture, using heparinized syringes and needles, 
at each of the following times: 5, 15, 30, 60, 90, 120, 240, 360, 420, 
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960, 1440 and 2880 min. Three additional mice were euthanized at five 
minutes after dosing with vehicle. Blood was transferred to microfuge 
tubes and stored on ice for less than three minutes before plasma was 
separated from the blood cells by centrifugation at 13,000 x g for 4 
minutes. 

Plasma Sample Preparation for HPLC Analysis 

Mouse plasma was immediately processed by solid phase extraction 
(SPE) to separate the lactone and hydroxy-acid forms of 9NC and 9AC. 
Prior to use, SPE cartridges (Waters OASIS^^ HLB 1 ml, 30 mg, Wa- 
ters Associates, Milford, MA) were conditioned with 1 ml of methanol 
and equilibrated with 1 ml of distilled deionized water. Plasma samples 
(200 /il, each) were mixed with 2.5 /il of internal standard (2 /ig/ml 
camptothecin in acetone) prior to loading on the HLB cartridges. After 
application of plasma, cartridges were washed with 1 ml of 5% methanol 
in water, and 9NC and 9AC lactone forms were eluted from the cartridge 
with 0.5 ml of methanol. The eluents were collected in microcentrifuge 
tubes and stored at -70 °C until analysis. Total 9NC and 9AC (the sum 
of lactone and hydroxy-acid forms) were determined by the methanolic 
deproteination of plasma. Plasma samples from the three mice at each 
time point were combined. Plasma (150 /il) was spiked with internal 
standard and 300 /il of cold (-20 °C) methanol, and vortexed for 1 min. 
The methanolic solutions were then centrifuged at 10,000 x g for 4 min- 
utes at 5 °C. The resulting supernatants were decanted into a cryovial 
and stored at -70 °C until HPLC analysis. 

HPLC Analysis 

The HPLC assay used to measure 9NC and 9 AC was modified from 
the 9AC HPLC assays by Kirstein et al. [9] and Supko and Malspeis 
[10]. Methanolic preparations of plasma for total 9NC and 9AC and 
solid phase eluates for the lactone forms of 9NC and 9AC were each 
split into 2 samples. 9NC in one portion of each of the methanolic 
preparations and the eluates from the SPE cartridges was converted to 
9AC by iron reduction. Lor the reduction, 12 p\ of the reducing reagent 
(25 mg reduced pentacarbonyl iron/ml H 2 O) and 6 /il of 12NHClwere 
added to 120 /il of each eluate or methanolic plasma preparation. These 
samples were vortexed for 1 min and sonicated for 30 min at 70 °C. The 
samples were then centrifuged (10,000 x g for 4 min). Reduction of 9NC 
to 9AC by reduced pentacarbonyl iron is 54%. Prior to placing these 
supernatants in siliconized (3% surfasil in toluene) HPLC autosampler 
vials, the reduced methanolic supernatants (150 /il) were mixed with 75 
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/xl of 0.5 M ammonium acetate in water, pH 5.5, while the supernatants 
from the reduced eluates (150 jd) were diluted with 100 fi\ of 0.5 M 
ammonium acetate in water, pH 5.5. 

The remaining portions of both the methanolic plasma preparations 
and the cartridge eluates of plasma that were not reduced were processed 
further to determine the concentrations of 9AC total and 9AC lactone 
formed as metabolites in the animals. Prior to placing these samples 
into the siliconized HPLC autosampler vials for determination of either 
9AC total (the conversion of the 9AC hydroxy-acid back to the lactone) 
or 9AC lactone, the pH of the samples was adjusted to be compatible 
with the mobile phase. To each of the unreduced methanolic plasma 
preparations, 15 /il of 20% phosphoric acid and 10 jA of 0.5 M ammonium 
acetate in water, pH 5.5, were added. To each of the unreduced cartridge 
eluates (150/xl), 10 /d of the 0.5 M ammonium acetate (pH 5.5) were 
added. 

The HPLC system consisted of a Waters autosampler set at 4 °C, a 
Waters 2695 Alliance System Separation Module, and a Waters fluores- 
cence 474 detector with excitation and emission wavelengths set at 365 
nm and 440 nm, respectively. Data processing was performed with a Wa- 
ters Millenium 32 system. The mobile phase was isocratic (9% methanol, 
23% acetonitrile and 68% 0.1M ammonium acetate (pH 5.5)), and the 
flow rate was 1.0 ml/min. Separation was achieved using a guard col- 
umn (Brownlee RP18, 7/nn, 15 x 3.2 mm) and a Beckman Ultrasphere 
C18, 5 nm ODS 250 X 4.6mm column (Allendale, NJ). The injection 
volume was 100 /d. All samples and standards were subjected to post 
column acidification prior to detection. HPLC column eluate was mixed 
with 0.3M trifluoroacetic acid at a flow rate of 0.3 ml/min prior to flu- 
orescence detection. Retention times for 9AC and camptothecin were 
respectively 7 and 15 min. 

Concentrations of 9AC lactone and 9AC total (hydroxy-acid and lac- 
tone, all converted to lactone form) were determined by comparison of 
the area ratios of the peak of interest to internal standard and com- 
pared to the standard curves of 9 AC. 9AC standards were prepared by 
dilution of a stock solution of 0.01 mg/ml 9AC in 10 mM acetic acid 
in dimethylformamide and added to the biological matrix to concentra- 
tions of 0.3, 1, 3, 10, 30, and 100 ng/ml. The standard curves for 9AC 
were linear from 1 to 100 ng/ml and had a %CV of less than 15% at 
midlow (5 ng/ml) and midhigh (60 ng/ml) concentrations. The lower 
limit of quantitation of the assay was 1 ng/ml. The concentrations of 
9NC lactone in the samples were determined by the difference between 
the concentrations in reduced samples, containing both 9AC and the 
9NC lactone converted to 9AC lactone, and the 9AC lactone concern 
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Fig. 3. Block diagram of five-compartment model of 9NC and 9AC pharmacokinet- 
ics. States (xi) represent drug masses, and kij are rates of transport or metabolism. 



tration in samples that did not undergo iron reduction. This difference 
was corrected by the conversion factor of 0.54 for 9NC to 9AC by iron 
reduction. Similarly, the 9NC total was determined by the difference 
of concentration in the reduced samples containing both total 9AC and 
9NC converted to 9AC and concentration of the total 9AC in the sam- 
ples that were not converted by iron reduction. Again the 9NC total 
concentrations of 9NC were corrected for the reduction of 0.54 of the 
9NC by the carbonyl iron. 

Mathematical Modeling 

Pharmacokinetics 

Compartmental modeling is a standard technique for constructing 
mathematical representations of drug PK. Using ADAPT II [11], a five- 
compartment linear model was constructed for 9NC/9AC PK in response 
to an oral dose of the drug. The block structure is shown in Fig. 3. 
Mathematically, this system can be represented as: 



x 0 = D(t) - koix 0 

±x = koiXo + fc 2 \X2 - (fcl2 + fcl3 + he)x\ 
X2 = h 2 Xi - k2\X2 
x 3 = ki 3 Xi + k 3i X 4 - (&34 + k 3e )x 3 
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X 4 — ^ 34^3 ^ 43*^4 

[9 JVC] = — 

Vl 

[QAC] = — (1) 

1>3 

Here the state xo represents the dynamics associated with gut absorp- 
tion of D(t), the orally-administered 9NC dose, and the plasma rate of 
appearance is governed by the rate fcot- States x\ and £2 are the 9NC 
lactone masses in the central and peripheral compartments, respectively, 
and £3 and £4 are similarly defined for 9AC. Metabolism of 9NC to 9AC 
occurs via £13, and metabolic elimination of 9NC and 9AC is at rates 
k\ e and k$ e , respectively. Concentrations of 9NC and 9AC are calculated 
by dividing drug mass in the plasma compartment by the distribution 
volume (v\ and 1*3, respectively). This model has 10 parameters that 
must be identified from data. 

An alternative model structure evaluated in the present work is shown 
in Fig. 4. The Laplace domain representation of Fig. 4 is equivalently 
represented by the following differential equations: 



x\ 



£2 



X 3 



£4 

[91VC] 
[9 AC] 



ko 1 

— D(t) £1 

T01 T01 

- 0 o )H(t - 0 O ) - —x* 

TQ 2 702 

— [£! + £ 2 - £3] 

n 

h 

^4 

T21 

k c l£3 

kc 2 [£4 + (1 - A: 2 )£3] 



( 2 ) 



The function Tt{t — 9o) is the Heaviside (unit step) function, which takes 
on the value of 0 when its argument ( t — &o) is less than 0, and 1 oth- 
erwise. The two-time constant model contains a “faster path” and a 
“slower path” for both 9NC and 9AC. The rational basis for this for- 
mulation stems from the two-time constant representations for distilla- 
tion columns in chemical engineering where the rapid (flow) dynamics 
are governed by the faster time constant, and the slower (composition) 
behavior is captured by the slower time constant [5,12]. Parameter es- 
timates for the two-time constant model were calculated by minimizing 
sum-of-squares prediction error between the experimental data and the 
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Fig. 4. Block diagram of the two-time constant model of 9NC/9AC PK. This 
model is presented in the Laplace domain. 



model prediction using th efinincon function of MATLAB (©2003 The 
Mathworks, Natick, MA). This model has 9 unique parameters. 

Pharmacodynamics 

Unperturbed tumor growth has been modeled according to a num- 
ber of equations [13,14,15]. The Gompertz equation is a common model 
structure that displays a slowing of tumor growth as the tumor reaches 
a “plateau population”, often greater than 10 10 cells. Given the size of 
the tumors implanted into the SCID mice in this study, combined with 
the length of time the tumors are studied (< 8 weeks), the saturation 
behavior characteristic of Gompertzian growth is not observed. Hence, 
an exponential model can be employed. There is a one-to-one relation- 
ship between tumor volume and the number of tumor cells if an average 
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cellular volume is used as the conversion factor. This allows unperturbed 
tumor growth to be represented by the following equation: 



N = 




( 3 ) 



Here A is the size of the tumor, given as the number of tumor cells, and 
r is the doubling time of the tumor, in days. 

Chemotherapeutic compounds interfere with tumor cells, thereby re- 
ducing the rate of tumor growth. Under the assumption of log-linear 
cell kill, common in the anticancer PD literature [15,16], the PD effect 
of the active compounds, 9NC and 9AC, can be incorporated in Eq. (3) 
as follows: 



A = ^A - k NC x\N - k AC x\N 

T 

x* = x\{t - e d )H(t - e d ) 

x*i = x 3 (t - 9 d )H{t - e d ) (4) 

In this model, a link is assumed between the plasma concentration of 
the anticancer drugs and their effect. Specifically, the effects of 9NC and 
9AC occur some time after drug administration. This time shift param- 
eter is given by 9d in the PD model, and it is the number of days after 
a dose of 9NC that the effect of that dose is observed. Hence, the “ef- 
fective concentrations” x\ and 23 are simply the plasma concentrations, 
x\ and 23, respectively, delayed in time by days. The cell kill terms 
are then taken to be proportional to the time-shifted drug concentration 
and the tumor size through a constant of proportionality for 9NC (k^c) 
and 9AC ( k A c )> respectively. 



RESULTS 

Pharmacokinetics 

The results of modeling 9NC PK using the five-compartment model 
and the two-time constant model are shown in Figs. 5 and 6, respec- 
tively. The parameter values for the two models are given in Tables 1 
and 2. While the five-compartment description does an adequate job 
of capturing the time-course of the drug kinetics, it is clearly unable to 
capture the plateau in [9 AC] observed around t — 35 min. In contrast, 
the two-time constant model sums the fast and slow dynamics to provide 
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a superior dynamic model in the plateau region, with no loss in dynamic 
behavior elsewhere in the PK profile. 



Table 1. Five-compartment pharmacokinetic model parameters. 



Parameter 


Value 


Parameter 


Value 


Parameter 


Value 


Vl 


0.0138 ml 


&01 


0.2080 min -1 


ku 


0.0191 min' 1 


V3 


0.0021 ml 




0.0047 min -1 


&21 


1.069 min -1 






&13 


0.0135 min -1 


&34 


0.0182 min -1 






&3e 


0.8255 min -1 


kn 


2.240 min -1 



Table 2. Two-time constant PK model parameters 



Parameter 


Value 


Parameter 


Value 


»o 


0.57 


T01 


6.5 min 


«2 


0.55 


T02 


49 min 


#02 


36 min 


n 


14 min 


k c \ 


0.17 kg/ml 


T21 


75 min 


kc2 


0.025 kg/ml 







Pharmacodynamics 

Tumor growth over the first 50 days post-implantation is exponential 
in shape, as shown in Fig. 7. Two cohorts of 10 vehicle-treated mice had 
their tumors measured over a six-week span. These twenty mice were fit 
individually by an exponential curve, Eq. (3), by varying r. The mean 
value for the tumor doubling time is f = 11±2.3 days. This mean tumor 
growth value was fixed and used in all subsequent pharmacodynamic 
studies. 

To estimate the effects of 9NC and 9AC on tumor growth, the pa- 
rameters k^c, k,AC, and of Eq. (4) were fit to mice treated at a dose 
of 0.67 mg 9NC/kg body weight on the standard QDx5x2 every four 
weeks schedule. Concentrations of 9NC and 9AC were generated by sim- 
ulating the PK model in response to the dosing schedule. Unperturbed 
tumor growth was established by using the mean r value. Individual PD 
parameter estimates were calculated for each of 10 mice, with the mean 
values shown in Table 3. 

Using the mean values of the PD parameters, combined with the PK 
model of Eq. (2), the model simulation was compared with the 10 indi- 
vidual tumor growth profiles from the treated group. Initial conditions 
of the simulated tumor size were matched to the individual mice for com- 
parison. The results are shown in Fig. 8. On the whole, the simulated 
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Fig. 5. Pharmacokinetics of 9NC and 9AC in response to a dose of 0.67 mg/kg 
administered by oral gavage at time zero. The five-compartment model fit is shown 
by the solid line. Each point corresponds to the mean of 3 mouse measurements (error 
bars shown). 





Fig. 6. Pharmacokinetics of 9NC and 9AC in response to a dose of 0.67 mg/kg 
administered by oral gavage at time zero. The two-time constant model fit is shown 
by the solid line, and the fast and slow dynamic model contributions to 9NC PK 
are given by the dotted and dashed lines in the top figure, respectively. Each point 
corresponds to the mean of 3 mouse measurements (error bars shown). 
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Fig. 7. Exponential model fit (solid line) to tumor xenograft data (two studies, 
squares and circles, each representing the mean of ten mice). Plus and minus one 
standard deviation in f is also shown (dashed lines). 



Table 3. Pharmacodynamic model parameters (with the PK governed by the two- 
time constant model). 



Parameter 


Value 


Parameter 


Value 


fciVC 


0.0321 ± 0.0111 day -1 


0 d 


4.23 ± 2.75 days 


kAC 


0.0387 ± 0.0151 day -1 


T 


11 ±2.3 days 



model performed admirably in matching the observed tumor growth be- 
havior, including the ability of the drug to reduce tumor size ( 4 fc h plot 
in the left hand column of Fig. 8). 



DISCUSSION 

Pharmacokinetics 

A more quantitative comparison of the two models is possible using a 
metric such as Akaike’s Information Criterion (AIC) [17]: 
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0 20 40 60 0 20 40 60 

time (days) time (days) 



Fig. 8. PK/PD model predictions (solid line) of individual mouse data (circles) 
with dosing of 0.67 mg/kg on days indicated by +■ PK was governed by the two-time 
constant model; mean parameter values were used in the PD model. 



( S SE \ 

— ) -f- 2N params (5) 

TV points / 

The results of this comparison are shown in Table 4. A mechanical treat- 
ment of the mouse PK data from Figures 5 and 6 would result in the 
analysis that the two models provide roughly equal AIC values and that 
the sum squared error (SSE) of the two-time constant model is higher 
than the five compartment model by 54%. However, a qualitative ex- 
amination of the two figures demonstrates that the two-time constant 
model passes through all of the error bars in the 9NC and 9AC figures, 
while the 5-compartment model misses the 9NC point around t = 35 
min. Furthermore, an examination of the individual point errors (which 
are summed to form the SSE) notes that 89% of the sum-squared er- 
ror for the two-time constant model is contributed by the t = 5 min 
point; the time point whose measurement has the largest error bars. 
Given the steep slope of PK curves in general (and the present curves 
in specific) at early time points, a small left-to-right shift can induce 
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a dramatic decrease in model prediction error. As such, the AIC and 
SSE analysis were repeated without the t = 5 min point contribution. 
As shown at the bottom of Table 4 the two-time constant model now 
provides a markedly better AIC and a more than five-fold reduction in 
SSE. Based on the qualitative characteristics of the fit, as well as the 
quantitative performance at all but the least precise sample times, the 
two-time constant model is a superior representation of the data versus 
the 5-compartment structure. 



Table 4. Comparison of 5-compartment (5-Cpt.) and two-time constant (2 T.C.) 
models using AIC and sum squared error (SSE). Data were analyzed including and 
not including the 5-minute PK sample. 





5-Cpt. 


2 T.C. 


AIC (all points) 


49 


51 


SSE (all points) 


227 


350 


AIC (no t = 5 min) 


48 


31 


SSE (no t = 5 min) 


209 


39 



Another issue arises in the analysis of the PK time-course data of 
Fig. 6. The “slower path” model becomes active at t = 45 minutes. At 
this point, the 9NC concentration displays a sharp comer as the faster 
model contribution is falling while the slower model contribution is ris- 
ing. Clearly, this behavior is not physiological. Of greater importance, 
however, is the eventual use for the model. If exact prediction of PK 
behavior is necessary, the two-time constant model is insufficient. How- 
ever, if the goal is to obtain a control-relevant model suitable for use 
in a model-based design algorithm, the ability to capture the dynamic 
character of the plateau in the data may offer sufficient performance im- 
provement to overcome any degredation introduced by the nonsmooth 
PK profile. 

Pharmacodynamics 

The pharmacodynamic representation developed in this work is as de- 
tailed as possible given the available data. The fact that 9NC and 9AC 
act by topoisomerase I inhibition means that they act in the S(ynthesis) 
phase of the cell-cycle [18]. Hence, a cell-cycle model would allow a more 
focused description of the pharmacodynamics. However, the mathemat- 
ical representation of the cell-cycle is overparameterized compared to 
the available data, such that an infinite number of possible solutions 
exist for the unperturbed growth state. Due to this overparameteriza- 
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tion, the present study employed an exponential model. The resulting 
pharmacodynamic effect model is necessarily global (with respect to the 
tumor) rather than focused on a particular cell-cycle phase because of 
the lumped nature of the exponential growth model. 

Previous studies have employed the pharmacodynamic effect model 
used in this study [16,19]. Equation (4) is intuitive insofar as tumor cell 
death is proportional to both drug concentration and tumor size. The 
drug concentration contribution is easily understood, as an increase in 
concentration implies an increased drug effect (i.e. cell death). The tu- 
mor size contribution is related to larger tumors having a greater number 
of cells susceptible to the drug at a given time. Given a fixed proportion 
of cells in a given phase of the cell cycle, the increased number of cells in 
a target cycle leads to the increased effect of cycle-specific compounds 
such as 9NC for larger tumors. This is an empirical way to represent 
cycle- specific phenomena because the relatively faster doubling time of 
tumor cells means a greater number will pass through the “target” phase 
of the cell-cycle while the chemotherapeutic is at an effective level. While 
this PD effect structure isn’t necessarily descriptive of the mechanism 
of action of 9NC/9AC, it captures the effect sufficiently (Fig. 8) while 
introducing only 3 parameters. 

Treatment Schedule Design 

Models of PK and PD are useful themselves, but they can also be em- 
ployed in the design of therapy either a priori or in an ongoing manner. 
One approach to model-based treatment design could be to minimize the 
tumor size at the end of a finite horizon, to minimize drug usage, and to 
maintain patient quality of life by satisfying constraints on drug delivery 
including acute and cumulative maxima and delivery above a therapeu- 
tic threshold. Mathematically, this problem could be formulated as: 



min 

U(k) 



subject to: 



C D 

£>(<) + r «5>(j) 

t=i j - i 

U{k) > u th 

U{k) < Uac 

M 

'y ] U{m) < Ucujn 
m= 1 



(6) 



The objective is to minimize tumor size N(i) over a horizon of length 
C (days, weeks, etc.) plus the weighted total drug usage (the sum over 
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j). If total drug usage is unimportant, T u = 0 can be used, while 
expensive drugs might be dosed to maximize effect but minimize cost. 
This trade-off can be explicitly addressed by varying T u . The three 
constraints require individual drug doses to be greater than a therapeutic 
concentration (£/«/,), less than an acute maximum threshold (U ac ), and 
the sum of all doses to be less than a cumulative drug toxicity constraint 
(Ucum)- The result of this optimization is a drug treatment profile, U{k), 
satisfying the objective (6). 



SUMMARY 

Pharmacokinetic models of anticancer compounds are growing in 
prevalence, and these provide implicitly useful guidance in drug dosing. 
With the construction of pharmacodynamic models of unperturbed, and 
treated, tumor growth, as well as toxicity, it becomes possible to explic- 
itly employ PK/PD models to design cancer treatment schedules. This 
work has demonstrated how current modeling tools and insight can be 
integrated to develop mathematical descriptions of drug kinetics (PK) 
and effect (PD) that are quantitatively accurate while having a struc- 
ture that is amenable to model-based optimization. The present study 
was preclinical in nature, using human tumor xenografts, and the con- 
cept is generalizable for use in human patients with a modest increase 
in treatment-related measurements. A long-term goal of this work is 
to develop inter-species scaling factors to allow preclinical results, such 
as those in the present study, to be generally applicable in a priori 
dosage design for clinical treatment. Finally, based on the premise that 
model quality limits achievable performance [4], the development of mea- 
surement techniques or devices to sample at finer spatial and temporal 
resolutions could lead to more detailed PK and PD models and hence 
improved dosage design algorithms. 
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INTRODUCTION 

Since the work of Kruger-Thiemer [1] and Jelliffe et al. [2] illustrating 
the application of pharmacokinetic models to the design of drug dosage 
regimens, a number of pharmacokinetically-based approaches for indi- 
vidualizing drug doses have been developed and applied in the design 
of clinical trails and in drug therapy. The early work focused on open- 
loop control methods that use only a priori information about a drug’s 
pharmacokinetics in dosage regimen design. With advances in drug as- 
say technology, feedback control methods were proposed by Sheiner and 
colleagues [3] that allowed drug concentration measurements obtained in 
a subject to be used to develop subject-specific dosage regimen. These 
methods for population and feedback dose regimen design, developed 
and evaluated in the 1970’s (e.g., Sawchuk, Zaske and Cipolle [4]) have 
been refined and are applied in both clinical trial and pharmacothera- 
peutic settings. (See [5] for a review of some of this work.) 

The deterministic a priori and feedback control approaches to dose 
regimen design cited above have been extended to incorporate known 
uncertainties in drug pharmacokinetics utilizing a Bayesian framework 
(e.g., [6-18]). (In this volume, Rosner et al. report on an approach that 
utilizes a Bayesian decision theoretic framework for dose regimen de- 
sign.) These stochastic control approaches, while computationally more 
demanding than the widely used deterministic methods, provide an esti- 
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mate of the complete distribution of the control objective, as well as key 
system variables in the population and individual. Thus, for example, 
a dose regimen could be design to control the probability that the drug 
concentration is below, above or within some specified target window 
in the population (or an individual). Such dose regimen design objec- 
tives may have more relevance than deterministic objectives that ignore 
uncertainty in the drug’s pharmacokinetics. 

Stochastic control approaches can be also applied to dose regimen de- 
sign problems aimed at targeting pharmacodynamic end-points. Much 
of current-day drug discovery focuses on identifying new molecular tar- 
gets for drug therapy. This effort is motivated in-part through a better 
understanding of the genetic and molecular basis of disease and the 
ability to measure molecular pharmacodynamic biomarkers, or surogate 
end-points in vivo, which may better reflect the ultimate therapeutic 
or study outcome than do end-points based solely on drug concentra- 
tions in accessible tissues (pharmacokinetic targets). The challenges for 
dose regimen design for stochastic control of pharmacodynamic targets 
include more complicated models needed to relate drug dose to phar- 
macodynamic response then to drug concentration, and the additional 
variability associated with a drug’s pharmacodynamic processes, which 
is often greater than pharmacokinetic variability. 

Building on previous work by our group [19-20] and others [21], this 
chapter outlines a general framework for controlling the pharmacody- 
namic response of a drug in both the population and the individual. A 
general method is outlined that applies some of approaches and tech- 
niques developed for the stochastic control of purely kinetic systems 
introduced in the above referenced studies, including target interval ob- 
jectives and sampling-based methods for Bayesian computations. The 
feasibility of the approach is illustrated using a simulation example of 
the control of forced expiratory volume by the anti asthmatic drug terbu- 
taline. 



PHARMACODYNAMIC MODEL 

It is assumed that the pharmacokinetics and pharmacodynamics of 
the drug under study can be described as a continuous, dynamic system 
with discrete time measurements of the drug’s effect, as given in the 
following equations: 



dx(t ) 
dt 



f{x(t),a,r{t),t ) , 



x(0) — c 



(1) 
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Time 

Fig. 1 . General dose regimen illustrating intravenous, r(t), or bolus, b(t), drug 
administration. 



z{tj) = h{x(tj), a, tj ) + v(tj ) , j = 1, . . . (2) 

where the random parameter vector a has known density p(a), and 
the emors associated with the observations of the drug’s effect are in- 
dependent, zero mean, and Gaussian with know variance (i.e., v(tj) ~ 

iV(0, <’]))• 

The drug is administered either as a piece-wise constant infusion 
( r(t ) = r<_i, dtj-i < t < dti, i = 2,...) or as a bolus input which 
is assumed to change the system’s states instantaneously (i.e., x(dt+) = 
x(dt~)+b(dti), i = 1,...,). Figure 1 illustrates both of these drug ad- 
ministration regimens, where represents the dose times (i.e., times 
at which the infusion changes rate or a bolus is administered). For 
notational convenience y(a, t) is used to represent the model output 
h(x(t),a,t). 



STOCHASTIC CONTROL PROBLEM 

The stochastic approaches for controlling pharmacokinetic processes 
(drug concentrations) have included open-loop control methods [6,7,10], 
open-loop methods utilizing drug concentration measurements as feed- 
back [8,9,12,13,14,18], and closed-loop or active control methods [11,15, 
16,17]. Moreover, a variety of objective or loss functions have been 
used with these pharmacokinetic stochastic control approaches, includ- 
ing quadratic objective functions (e.g., minimize expected squared de- 
viation between target plasma concentration and model prediction) and 
target interval objective functions (e.g., maximize probability that drug 
concentration is within a pre-selected target interval). The particular 
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control strategy and objective function adopted depends on the objec- 
tive of drug administration (e.g., clinical drug trial or patient treatment), 
the specific clinical environment (e.g., clinical research center, intensive 
care unit, out-patient setting) and on the drug administered. 

The class of stochastic pharmacodynamic control problems considered 
herein is open-loop feedback control in which measurements of drug re- 
sponse obtained at one or more times during the course of the dosage 
regimen are used to design subsequent doses later in the regimen. The 
control scenario is divided into two phases, an a priori phase in which 
the initial dose regimen is designed using the prior information about the 
drug’s pharmacokinetics and pharmacodynamics (i.e., using p(a)) and a 
feedback phase (that may have multiple stages) in which the measured 
drug effect, z(tj), j = 1, . . ., is used together with the prior information, 
p(a), to design subsequent drug doses. The dose adjustment during the 
feedback phase can occur whenever one or more new drug effect mea- 
surements are available. The class of control objectives considered is 
target interval objectives in which the dose regimen is designed to max- 
imize the probability that the drug response is within a selected interval 
at the target time (see below). The computational methods introduced 
below to solve the target interval control problem can be readily applied 
to the dose regimen design problem with other loss functions, including 
quadratic control objectives. In the implementation presented below it 
is assumed that the control target times coincide with the dose times. 

The solution to the dose regimen design problem for the a priori 
control phase is given below for the bolus dosing problem (the solution 
for the infusion case follows readily): 



b* = arg{ max [ p(y(a, ti))dy(a, U)} , i = (3) 

The quantities L{ and U{ represent the lower and upper ranges of the 
drug effect target intervals at each of the target times, £*, in the a priori 
control phase. Calculation of the i th optimal dose, 6*, is conditional 

on all proceeding optimal doses, b*_ { , b*_ 2 , For a given dose, the 

distribution of the drug response at target time f*, p(y(a, £*)), depends 
on the known prior distribution of the model parameters p(a). 

Using measurements of drug effect obtained during the a priori control 
phase ( z = [z(ti), z(t m )] T ), the posterior density of the model pa- 

rameters, p(a\z), is calculated via Bayes’ theorem (p(a\z) = l(a\z)p(a)/ 
f l(a\z)p(a)da ), where the likelihood l(a\z) is readily obtained following 
the model definition in Eqs. (1) and (2). Given this posterior density, 
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the solution to the feedback control problem is (for the case of bolus 
inputs): 



rui 

b* = arg{ max / p(y(a, ti))dy(a, U) }, t = fc + 1, . . . (4) 

bi\b',j<i 

It is assumed that fc doses (6J, 6£) have been administered in the 

a priori control phase (fc > 1), and that the target times (ti, i = k + 
1,...) and target intervals (T7j = [L,, Ui\, i = fc + 1,...) are defined. 
If multiple feedback stages are to be considered, then Eq. (4) can be 
applied repeatedly together with the corresponding updated posterior 
density p{a\z). 



COMPUTATIONAL METHODS 

Calculation of the optimal dose regimen defined in Eqs. (3) and (4) 
requires solution of three computational problems: calculation of the 
density of the model effect at the target times; calculation of the integral 
of the effect density; and maximization of the integral over the admis- 
sible domain of doses. This latter problem involves a one-dimensional 
optimization, and as such is readily solved using a direct search or other 
suitable univariate optimization method. To determine the density of 
the model response, several sampling-based methods described below 
are used to generate samples from the drug effect densities. Using these 
samples, the necessary integrals in Eqs. (3) and (4) are then readily 
approximated via summations. 

For the a priori control problem, samples are generated from the re- 
quired densities needed to solve Eq. (3) by first drawing samples aj 
from the known prior density p(a) and then simulating the correspond- 
ing sample y(aj, U), from p(y(at, U )). The integral in Eq. (3) is then 
approximated as follows: 

f P(y(«- U))dy(a, U) “ 1 / N <* ( 5 ) 

Jhi rv{aj,U)£Tl t 

where Qj, j = 1, N a , are independent samples from p(a). We 
discuss below the issue of sample size, N a , and the accuracy of the 
solution of the corresponding control problem. Monte Carlo integration 
as used above (specifically, the sample-mean Monte Carlo method [22]) 
is applicable to the a priori control problem, since independent samples 
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can be drawn from the drug effect density during the a priori control 
phase. 

One can also solve the feedback phase control problem in Eq. (4), 
given samples from the posterior parameter density p(oc\z) (samples from 
the effect density can then be obtained by simulating the model for a 
given dose regimen). Two methods for generating samples from the 
posteriori density p{pt\z) are implemented: the acceptance-rejection (A- 
R) sampling method and a Markov Chain Monte Carlo (MCMC) method 
based on the Metropolis-Hastings (M-H) algorithm. 

Acceptance-Rejection (A-R) Method 

A number of methods are available for directly generating independent 
samples from the posterior density p(a\z) (also referred to as the target 
density) given samples from the prior density p(a) (also referred to as the 
proposal density). The A-R sampling method is one such method that is 
also relatively easy to implement (see [22-24]), and has been applied to 
pharmacokinetic stochastic control problems (e.g., [13,14]). To generate 
samples from p(a\z) defined above, the following implementation of the 
A-R method is used: 

1. generate a u i = 1, TV, independent samples from p(a) 

2. approximate max{T(a|z)} by M = arg{maxT(at|z)} over all c*i 

3. generate u t) i — 1, . . . , TV, independent samples from U(0,1) 

4. for i— 1, . . . , TV; if rtj < l(ati\z)/M, then accept a* 

The samples generated from this A-R algorithm, ay, j — 1, . . . , Nf 
( Nf < TV), are samples from p(a|z) that then are used to solve the 
control problem in Eq. (4) as outlined above. The appropriate selection 
of N is critical to the algorithm’s performance. For a specified level of 
accuracy in the control calculation, the selection of N will depend on the 
particular data and resulting likelihood function, with the efficiency of 
the algorithm (TV/TV/) decreasing as the posterior density deviates from 
the prior (see, for example, [14]). Variations of the basic A-R algorithm 
as well as other related sampling methods have been proposed with the 
aim of improving the efficiency of the A-R method described above (e.g., 
ratio-of-uniform method [14], adaptive A-R sampling [25]). 

Metropolis-Hastings (M-H) Algorithm 

A Markov chain Monte Carlo (MCMC) method is also used to solve 
the feedback stochastic control given in Eq. (4). For this purpose the 
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following Metropolis-Hastings algorithm as suggested by Tierney is used 
[26] (independence Metropolis-Hastings algorithm IMH): 

1. select aoi and initialize the chain at ao 

2. for i = 0, . . . , Nf 

3. generate y fromp(a) 

4. generate*/ fromU(0,l) 

5. if u < min{/( 2 /| 2 )//(aj| 2 ), 1} then aj+i = y, else a<+i = a* 

After the chain has converged, the samples generated from the algo- 
rithm are samples from p{a\z). These samples are then used to solve the 
control problem in Eq. (4), as described above for the a priori control 
phase. It should be noted that the Markov chain in the IMH algorithm 
will generally have periods where the chain does not move (eq+i = a t ), 
and the efficiency of the control algorithm can be improved by exploiting 
this local stationarity of the chain. 



EXAMPLE: CONTROL OF FORCED 
EXPIRATORY VOLUME BY THE 
ANTI-ASTHMATIC DRUG TERBUTALINE 

Pharmacokinetic/Pharmacodynamic Model 

The stochastic control framework outlined above is illustrated using 
a pharmacokinetic/pharmacodynamic model of the drug Terbutaline. 
Terbutaline is a fa agonist that can be administered orally or subcu- 
taneously for the treatment of reversible airway obstruction. Several 
studies have demonstrated the relationship between plasma concentra- 
tion of fa agonists, including Terbutaline, and the drugs’ bronchodila- 
tor effects, for example, as measured by forced expiratory volume at 
one second ( FEV \ ) [27]. These studies have also revealed considerable 
variability in the therapeutic and toxic response of patients for a given 
plasma concentration of Terbutaline. It is therefore more appropriate to 
target the therapeutic response and adverse effects of fa agonist such as 
Terbutaline, rather than controlling plasma concentration per se. Ac- 
cordingly, Terbutaline represents a good example for the application of 
the pharmacodynamic stochastic control framework introduced above. 

A model for the simultaneous pharmacokinetics and pharmacodynam- 
ics (FEV\ as the drug response) of Terbutaline has been reported by 
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Table 1 . Mean Values of Model Parameters from the Prior Density. 



K a 

(hr -1 ) 


Kc P 

(hr- 1 ) 


K pc 

(hr’ 1 ) 


K e l 

(hr- 1 ) 


V 

(L) 


Kco 

(hr- 1 ) 


EC 50 
(ng/ml) 


Emax 

(L) 


£0 

(L) 


3.89 


0.44 


0.52 


0.44 


49.8 


5.50 


2.33 


3.76 


2.01 



Oosterhuis et al. [27], and is illustrated in Fig. 2. The pharmacokinet- 
ics of the dug are described by a three compartment linear model that 
includes a first-order absorption process for the subcutaneous uptake of 
Terbutaline (b ( t ) represents the bolus dose regimen). In the model Ka is 
the absorption rate constant, Kcp and K pc are the inter-compartmental 
rate constants, K e i is the elimination rate constant, and V is the central 
compartment distribution volume. The relationship between the plasma 
concentration of Terbutaline and FEV\ is modeled through a drug ef- 
fect site which in turn is related to FEV\ by a sigmoid response function 
where the three dynamic parameters, E max , EO and EC50, denote the 
maximum drug effect, the baseline effect, and the effect site concentra- 
tion producing 50% of E max , respectively. In the model equations given 
below, aq ( t ), £2 ( t ), and £3 ( t ) represent the amount of drug in compart- 
ments 1, 2, and 3, respectively, while £4 ( t ) is the concentration of drug 
in the effect site and y ( t ) is the drug response FEV\. 



dx\(t ) 
dt 

dxi(t) 

dt 

dx 3 (t ) 
dt 

dx\{t) 

dt 

y(a, t ) 



= -(Kel + Kcp)Xi{t) + K a X-2(t) -I- KpcX 3 (t) 
= -K a x 2 (t) 



KcpXi(t) - K pc x 3 (t) 
K eo (^--x 4 (t)) 

Fft 4- ^ max ~ ^21 -r (71 

+ EC50 + x 4 (t) 4 ® 



(6) 



The model parameters (a = [K a K cp K pc K e i V c K e0 EC 50 £ max £'0] T ) 
are assumed to be distributed lognormally (a ~ LN(fi t Q)) with popu- 
lation mean /r and covariance fl as reported in [27] and given in Tables 
1 and 2 respectively. 
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Fig. 2. Pharmacokinetic /pharmacodynamic model for Terbutaline. 



Dose Regimen and Control Objectives 

The particular dosing scenario selected for the example is illustrated 
in Fig. 3. The a priori control phase involves determination of a sin- 
gle bolus dose of drug, 6i administered at time 0.0, while the feed- 
back control phase has two stages in which doses 62 (administered at 
(= 24.0 hr) and 63 (administered at (= 48.0 hr) are calculated. For the 
a priori control phase the target time =8.0 hr and the corresponding 
lower (Li) and upper (Ui) values of the target interval for the drug's 
response (FEV i) are L 1= 2.0 L and U\=2.5 L. Following administra- 
tion of the optimal a priori dose b\ (obtained from Eq. (3)) in a given 
subject, three measurements of FEV\ are assumed to be available at 
1.0, 8.0 and 23 hr (2 = [2(1) 2(8) z( 23 )] r ), as illustrated in Fig. 

3. The first feedback control dose is then calculated (Eq. (4)) with 
target time ( 2 — 32.0 hr, and Z,2=2.0 L and (72=2.5 L. After adminis- 
tration of 62 in the subject, three additional measurements of FEV j 
are obtained during the second dose interval (at 25.0, 32.0 and 47.0 hr) 
and used to calculate the next feedback control dose 63 from Eq. (4) 
(( 3 = 56.0 hr, £3=2.0 L and (73=2.5 L, with the vector of observations 
2 = [2(1) 2(8) 2 ( 23 ) 2 ( 25 ) 2 ( 32 ) z(47)] T ). This process could con- 
tinue regardless of whether feedback measurements of FEV \ available 
from a particular dose interval. 
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Table 2. Covariances of Model Parameters from the Prior Density. 





K a 


K cp 


K pc 


Kd 


k 




(hr- 1 ) 


(hr- 1 ) 


(hr" 1 ) 


(hr" 1 ) 


(L) 


K a 


7.40 










Kcp 


-0.40 


0.045 








Kp C 


-0.18 


0.007 


0.037 






K el 


-0.24 


0.018 


-0.009 


0.030 




k 


3.52 


-1.68 


1.55 


-2.73 


404.9 


K eo 


1.59 


-0.28 


0.23 


-0.040 


15.2 


EC50 


-0.34 


0.15 


-0.047 


0.037 


-6.64 


Emax 


0.13 


0.051 


0.008 


-0.094 


8.76 


E0 


0.36 


0.020 


-0.057 


-0.087 


6.72 





Ka 

(hr- 1 ) 


K C p 

(hr- 1 ) 


K pc 

(hr" 1 ) 


Kel 

(hr" 1 ) 


Ka 










K cp 










Kpc 










Kd 










V 










Kco 


7.87 








EC50 


-4.42 


3.06 






Emax 


-1.14 


0.77 


1.09 




EO 


-1.82 


0.98 


0.72 


0.91 



Population Simulation of the Control Algorithm 

To assess the performance of the proposed framework for the control 
of FEV\ a population simulation is implemented. The optimal a priori 
control dose, b\, is used to simulate the FEV\ response of 500 randomly 
selected simulated subjects from the prior parameter distribution. The 
random vectors of the model parameters are generated from the popu- 
lation (prior) multivariate lognormal density as described in [28], (For 
the example considered here, the log transform of the original subject 
parameter estimates from [27] were used to define the transformed multi- 
variate Normal density of model parameters from which random samples 
were obtained.) Given the target interval selected, [2.0L, 2.5L], samples 
generated from the priori density are excluded if E0 >2.5 L or E mSLX <2.0 
L. To simulate the feedback measurements for a given simulated subject, 
Normally distributed measurement error (mean 0.0 and standard devi- 
ation equal to 10% of the exact simulated model response) is added to 
the simulated FEV\ response at each measurement time. 
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Time (hours) 



Fig. 3. Dose regimen scenario for Terbutaline. 



RESULTS 

A Priori Control Phase-Population 

Solution of Eq. (3) for the Terbutaline example yields an optimal dose 
of 6J=0.50 mg for the population. Given this dose, the resulting distri- 
bution of FEV\ at the target time fi=8.0 hr, p(FEV\(t\) is displayed 
in Fig. 4. This density was constructed by applying a piece-wise linear 
smoother to the samples obtained from p(F EV\(t\) that were gener- 
ated as part of the solution to the a priori control problem as presented 
above. (Only admissible samples from the prior density were used to 
construct samples from p(FEV\(t\)) (i.e., £0 >2.5 Land E m ax <2.0L).) 
The number of samples N a was determined empirically (via repeated 
solutions of Eq. (3) with different values of N a ) so as to yield two dig- 
its of accuracy in the resulting optimal dose calculation ( N a — 20, 000). 
The mean and standard deviation of the distribution of FEV\ shown in 
Fig. 4 are 2.3 L and 0.58L. Of more interest, given the control objec- 
tive, the following target interval and related probabilities are calculated: 
Prob(2.0L < FEVi(t x ) < 2.5 L) = 0.33; Prob{FEV x (t i ) < 2.0L) = 0.32; 
and Prob{FEV\{t\) > 2.5 L) = 0.32. 

The 95% highest density interval (h.d.i.) for FEV\(ti) is calculated as 
the smallest FEV \{h) interval containing 95% of the density of 
p{FEV l (*0). The resulting 95% h.d.i. calculated for FEV\(t\) is [1.3L, 
3.4L]. From the calculated density for FEVi(t\) given the optimal dose 
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p(FEV 1 (t=8) | b 1 ) 




Fig. 4. The resulting distribution of FEV1 at the target time ti=8.0 hr 
(p(FEVi(ti)) given the optimal dose b{ = 0.50 mg. 

&*, other relevant quantities can be calculated such as the probability 
that FEV\(ti) is less than some specified critical value ( Prob(FEV\(t\ ) < 
Critical feVi))- This calculation then provides an estimate of the pro- 
portion of the population that does not respond adequately (as defined 
by CriticalFEVi) to the dose of Terbutaline that is optimal for the entire 
population. 

Feedback Control Phase - Selected Subject 

To more fully illustrate the pharmacodynamic stochastic control frame- 
work proposed above, the results from a single simulated subject are 
presented in detail. A simulated subject was selected for which the 
population optimal dose b\ produced a value of FEV\(t\) considerably 
below the target interval. The parameter values for the selected sub- 
ject are K a = 2.2, K cp = 0 . 12 , K pc = 0.62, K e i = 0.49, V — 34.7, 
K eo = 9.2, EC50 = 1.32, E max = 2.7, EQ = 0.65 with units given in 
Table 1. The resulting simulated measurements of FEV\ at 1, 8 and 
23 hr. ( 2 ( 1 ) = 2.5, 2 ( 8 ) = 1.24 and 2(23) = 0.64L) were then used 
to solve the feedback control problem for b* z (Eq. (4)). Using the A-R 
method, the resulting solution to Eq. (4) was = 3.8mg, which in 
turn was used to simulate measurements of FEV \ at 25, 32 and 47hr 
(2 (25) = 2.76, 2 (32) = 2.10 and 2 (47) = 0.71L). The complete set of six 
simulated measurements of FEV j was then used to solve Eq. (4) for the 
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Simulated Subject 




Fig. 5. The densities of FEVi for a selected subject at each of the three target 
times, p(FEVi(t\)) (same as Fig. 4), p(FEV\(ti)) and p(FEVi(t 3 )), for the optimal 
dose regimen, b\ = 0.5 mg, = 3.8 mg and 63 = 3.7 mg. 



second feedback stage resulting in 63 = 3.7mg. A value of N = 5000, 000 
was used for both feedback control stage calculations and selected to 
achieve two digits of accuracy in each calculated optimal dose, as de- 
scribed above. For this subject, the number of accepted samples in the 
first feedback control stage (calculation of 6$) was Nj = 1,096, while for 
the second feedback stage (calculation of 63) Nf = 244. 

Figure 5 shows the densities of FEVi for this subject at each of the 
three target times, p(FEV\(t\)) (same as Fig. 4), p(FEV\{t 2 )) and 
p(FEV\{tz)), for the optimal dose regimen, b\ = 0.6 mg, b* 2 = 3.8 mg 
and 63 = 3.7mg. The probability that FEVi is in the [2.0L, 2.5L] target 
interval increased from 0.33, given the a priori dose, to 0.81 following 
the first feedback dose, and 0.95 following the second feedback dose. As 
noted above for the a priori control phase, samples from the densities 
p(FEVi(t 2 = 32)) and p(FEVi(t$ = 56)) can be used to calculate the 
95% h.p.d. interval for each distribution: the 95% h.p.d. interval cal- 
culated for p(FEV\{t 2 — 32)) is [1.9L, 2.7L] and for p(FEV\(t$ = 56)) 
is [1.95L, 2.45L]. The control calculations have also been repeated for 
the selected individual using the IMH algorithm (77=5000.000 for both 
stages), resulting in b 2 = 3.7mg and 63 = 3.9mg with very similar target 
interval probabilities to those obtained using the A-R method. 
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It is also instructive to examine the marginal densities for the model 
parameters obtained using the samples generated from the posterior dis- 
tributions for each feedback stage. Figure 6 shows the marginal densi- 
ties for four of the model parameters, two parameters from the kinetic 
portion of the model (K a and V) and two from the pharmacodynamic 
model (EC 50 and E max ). For each parameter, Fig. 6 displays the 
computed marginal density from the posterior distribution calculated at 
the first feedback stage (p(K a \z), p(V\z), p(EC5Q\z), p(E max \z), z = 
[z (1) 2 (8) 2 (23)] T ) and at the second feedback stage (p(K a \z'), p(V\z'), 
p(ECm\z'), p(E max \z'), z' = [z (1) 2 (8) z (23) 2 (25) 2 (32) 2 (47)] r ). Also 
shown for each parameter is the marginal density of the prior distribu- 
tion (obtained from the samples generated to solve the a priori control 
problem). These computed marginal densities provide insight into the 
amount of information in the FEV 1 measurements about the underlying 
model parameters. 

Population Simulation Results 

A histogram of the values of FEV\ (t\ = 8.0) obtained from 500 sim- 
ulated subjects given b\ = 0.5 mg is shown in Fig. 7. The computed 
density p(FEV j (t\ = 8.0)) (from Fig. 4) is also superimposed on the 
histogram in Fig. 7. Table 3 compares the predictions, including the 
target interval probability obtained from the computed density to the 
results obtained from the simulations. Also shown in Table 3 is the 
percent of the 500 simulated subjects for which the simulated value of 
FEV 1 (<1 = 8.0) was within the computed 95% h.p.d. interval (i.e., per- 
cent coverage of the h.p.d. interval). 



Table 3. Comparison of Calculations from the Computed Density, p(FEVi(ti = 
8.0)), and from the Simulations for the a Priori Control Phase. 





Computed Density % 


Simulation % 


Prob(2.0L < FEViih) < 2.5 L) 


33 


36 


Prob(FEVi{ti) < 2.0 L) 


32 


30 


Prob(F EV\{t\) > 2.5 L) 


32 


34 


% Coverage of 95% h.d.i. 


- 


96 



The results from the complete feedback simulation of the 500 subjects 
are shown in Fig. 8, including the histograms of the simulated responses 
FEV 1 (t\ = 32.0) andFEVi (t\ = 56.0) (the histogram of FEV 1 (ti = 8.0) 
is show again for comparison). Table 4 is a summary of the simulation 
results, showing the percent of the subjects with FEV 1 values within the 
target interval at each of the target times as well as the percent coverage 
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(a) (b) 




(c) 



(d) 



Fig. 6. Marginal densities of selected model parameters for the sample subject. 



of each simulated subjects h.p.d. interval (i.e., the percent of subjects 
for which the calculated 95% h.d.i. for FEV1 contains the actual value 
of FEVi). 

The number of samples used in the A-R and IMH algorithms for calcu- 
lation of the optimal feedback doses was again determined (empirically) 
to produce at least two digits of accuracy in the resulting doses b 2 and 
63 in all simulated subjects. This required repeating the population sim- 
ulation with the 500 simulated subjects for different values for N (in the 
A-R algorithm) and Nf (in the IMH algorithm). For convenience, the 
same number of samples is used for both feedback stages. The results 
shown in Fig. 7 and 8 and Table 3 and 4 were obtained using the A-R 
algorithm and were similar to those obtained using the IMH sampler. 
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FEV 1 (t=8) (Liters) 

Density (%) Simulation (%) 



£ 

u. 



FEV1 in Tl 


33 


36 


FEV1 < L 


32 


30 


FEV1 > U 


35 


34 


% Coverage of 95% h.d.i. 


= 96% 



Fig. 7. A histogram of the values of FEV\(t\ = 8.0) obtained from the 500 
simulated subjects given 6J = 0.5 mg. 



Table 4. Summary of the Simulation Results for the Target Interval Probability, 
Prob(‘2.0L < FEVi(t) < 2.5L), for each Target Time. 



Target Time 


Simulation % 


% Coverage of 95% h.p.d. 


ti = 8.0 


36 


96 


t 2 = 32.0 


80 


93 


t 3 = 56.0 


95 


94 



DISCUSSION 

Pharmacokinetic principles have been used for many years to guide 
the design of dose regimens in drug development clinical trials and indi- 
vidualized drug therapy. Several methods and applications of pharmaco- 
kinetically-guided dose regimen design have been reported that formally 
incorporate information about the uncertainty in the drug’s pharma- 
cokinetics in designing a dosing regimen. As illustrated in this chapter, 
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Simulation of Control Process (500 subjects) 




FEVj (Liters) 





FEV 1 in Tl (%) 


% Coverage of 95% h.d.i. 


t = 8 hr 


36% 


96% 


t = 32 hr 


80% 


91% 


t = 56 hr 


95% 


93% 



Fig. 8. Histograms of the simulated responses FEV\{t\ = 32.0) and FEV\(t\ = 
56.0) (the histogram of FEV\(t,\ = 8.0) is show again for comparison) from the 
complete feedback simulation of the 500 subjects. 



these stochastic control methods can be applied to problems involving 
dose regimen design for targeting pharmacodynamic end-points for use 
in effect-controlled clinical drug trials and in therapy. Given the current 
focus on identifying new molecular targets in drug discovery, the use 
pharmacodynamic biomarkers and surrogate end-points in drug devel- 
opment clinical trials motivate the further development of methods for 
pharmacodynamic stochastic control. 
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